Express Mail Label No. EV 312710694US 
Date of Deposit: September 15, 2003 

21402-612 (CURA-9 12) 

ZONE 3 NECROSIS ASSOCIATED MARKERS AND METHOD OF USE 

THEREOF 

Related Application 

5 This application claims priority to U.S. Serial No. 60/410,763, filed September 13, 2002. 

The contents of this application is incorporated herein by reference in its entirety. 

Field of the Invention 

The invention relates generally to the identification of genetic markers associated with 
toxicity. 

1 0 Background of the Invention 

Necrotic cell death is a common response of the liver to hepatotoxic agents and represents 
an irreversible form of damage to individual liver cells. While the mechanisms by which 
hepatotoxic agents lead to necrosis remain to be fully understood, progress has been made in 
understanding the biochemical pathways involved. Necrotic cell death occurs when a chemical 

15 or it's metabolite react with critical cellular systems resulting in ion dysregulation, mitochondrial 
dysfunction and oxidative stress [1-3]. The acute disruption of these normal cellular events in 
effect leads to ATP depletion. This loss of energy within the cell distinguishes necrosis from the 
other classification of cell death known as apoptosis. When cells undergo apoptosis or 
programmed cell death, the cell requires an energy level capable of triggering special metabolic, 

20 signal transduction and gene regulation pathways that systematically shut down the cell. 

Necrosis occurs when the ATP levels falls below the threshold required for these processes and 
the cell is driven into a passive state of cellular chaos that culminates in cell death [4]. Thus, 
although these two forms of cell death are distinct, they can share initiating pathways depending 
on the how sharply the ATP levels decline. Such can be seen with the induction of the 

25 mitochondrial permeability transition (MPT), a mechanism that causes mitochondrial failure. 
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The MPT will lead to necrosis if ATP is depleted or apoptosis if there are sufficient amounts 
available to initiate a caspase cascade [5]. 

Chemical insult that produces necrosis of the liver can be either nonzonal or zonal. Zonal 
necrosis is separated into zones 1, 2 and 3 based on the region of the lobule affected. Different 
5 hepatotoxic agents preferentially target specific zones [6]. This research project report 

specifically deals with those agents that produced zone 3 or centrilobular necrosis. This is the 
most commonly affected area of the liver for hepatotoxic agents producing zonal necrosis. 
Zones 1-3 are distinguishable in terms of blood flow, oxygen content, bile flow and ratio of 
intoxication versus detoxification pathways. Factors such as these explain the specificity of 

10 hepatotoxic agents for particular zones. Acetaminophen and carbon tetrachloride (CCL4) are 
examples of agents that produce mainly zone 3 necrosis once they are converted to reactive 
metabolites. This can be attributed to the high degree of regional organization of agent specific 
cytochrome P450's within the liver [7]. The supply of oxygen available to the cell has also been 
shown to be a factor for zone 3 necrosis producing agents. Zone 3 is the region of the liver that 

1 5 is furthest from the arterial blood supply receiving the least supply of oxygen. When CC1 4 is 
metabolized to its reactive metabolite CCI3, the reduction reaction is inhibited by oxygen, 
favoring a necrotic response in the centrilobular area [7]. 

Liver cell necrosis can evoke a range of responses within the liver that depend on the 
severity of insult. These responses range from regeneration of necrotic tissue with restoration of 

20 full liver function to concomitant loss of liver function, liver failure and death [8]. In the 
process, necrosis may trigger the development of other liver diseases. Recurring bouts of 
necrosis and repair may result in disruption of the structure of the liver and result in subacute 
hepatitis, chronic hepatitis or even cirrhosis [9], In this process, inflammatory cells stimulate the 
deposition of collagen around hepatocytes causing alteration in hepatic function and blood flow 

25 [10]. There is also evidence that necrosis may play a role in the induction of early hepatocellular 
carcinoma through compensatory liver regeneration. Diethylnitrosamine and Fumonisin B(l) are 
two examples of compounds that show evidence of producing hepatocellular carcinoma in rats 
through a sequence of events that begin with necrosis [11-13]. Thus the benefit of obtaining 
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marker genes predictive of hepatic zone 3 necrosis stem from its participation in the pathogenesis 
of other liver diseases as well as it being an early indicator of hepatic toxicity. 

Summary of the Invention 

The invention is based in part on the discovery that certain nucleic acids are differentially 
5 expressed in liver cells or liver tissue of animals treated with toxic compounds. These 

differentially expressed nucleic acids include novel sequences and nucleic acids sequences that, 
while previously described, have not heretofore been identified as associated with toxicity and 
are collectively referred to herein as "TOXMARKER nucleic acids" or "TOXMARKER 
polynucleotides" and the corresponding encoded polypeptides are referred to as "TOXMARKER 
10 polypeptides" or "TOXMARKER proteins". The TOXMARKER genes are useful in high 
throughput screening of potential therapeutic compounds for toxicity. 

In on aspect the invention provides methods of predicting the hepatotoxicity of a test 
agent. Hepatotoxicity is predicted by determining the level of expression of a toxicity-associated 
gene in a cell exposed to a test agent. The level of expression of the toxicity-associated gene is 
1 5 compared to the level of expression of the toxicity-associated gene in a control population 

exposed to a control agent. A test agent is predicted to be toxic if an alteration (e.g., increase or 
decrease) in the level of expression in the cell exposed to the test agent compared to the control 
population is identified. 

Also provided by the invention are methods of screening a test agent for inducing changes 
20 in gene expression associated with a toxic agent. An agent is screened for inducing changes in 
gene expression associated with a toxic agent by determining the level of expression of a 
toxicity-associated gene in a cell exposed to a test agent. The level of expression of the toxicity- 
associated gene is compared to the level of expression of the toxicity-associated gene in a control 
population exposed to a control agent. 

25 The alteration is statistically significant. By statistically significant is meant that the 

alteration is greater than what might be expected to happen by change alone. Statistical 
significance is determined by method known in the art. An alteration is statistically significant if 
the p-value is at least 0.05. Preferably, the p-value is 0.04, 0.03, 0.02, 0.01, 0.005, 0.001 or less. 
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By toxicity- associated gene is meant a gene that is characterized by a level of expression 
which differs in a cell exposed to a toxic compound compared to a control population. A 
toxicity-associated gene includes for example TOXMARKER 1-13 1. Preferably, the toxicity- 
associated gene is the genes listed on Table 5. More preferably, the toxicity-associated gene is 
5 TOXMARKER 42, 59, 65, 66, 71, 76, and 97 

A control population is a for example a cell not exposed to a toxic agent. Optionally, the 
control population is exposed to a control agent. A control agent is an agent that does not elicit a 
histology associated with a condition associated with liver toxicity such as Cholestasis; Steatosis; 
Reactive Inflamation; Necrosis, e.g., zone 3, general or multifocal; Genotoxic Carcinogenesis; 
10 Hepatocellular Hypertrophy; Non-Genotoxic Carcinogenesis; Appoptosis and Kupffer Cell 

Aggregation. Exemplary control agents are those listed in Table 1 and Table 2 below. A control 
level is a single expression pattern derived from a single control population or from a plurality of 
expression patterns. For example, the control level can be a database of expression patterns from 
previously tested cells. 

1 5 The test cell is provided in vitro. Alternatively, the test cell is provided ex vivo or in vivo 

from a mammalian subject. The test cell is derived from liver tissue, such as for example a 
hepatocyte. Alternatively, the test cell is a subject derived cell sample. The subject derived 
tissue sample is any tissue from a test subject. 

Expression is determined by for example detecting hybridization, e.g., on a chip, of a 
20 toxicity-associated gene probe to a gene transcript of the test cell. 

The invention also provides a zone 3 necrosis reference expression profile of a gene 
expression level two or more of TOXMARKER 1-132. For example, the reference profile 
contains the expression levels of TOXMARKER 1-132. Alternatively, the reference profile 
contains the expression levels of TOXMARKER genes listed on Table 5. Preferably, the 
25 reference profile contains the expression levels of TOXMARKER 42, 59, 65, 66, 71, 76, and 97 

The invention also provides a kit with a detection reagent which binds to two or more 
TOXMARKER nucleic acid sequences or which binds to a gene product encoded by the nucleic 
acid sequences. Also provided is an array of nucleic acids, e.g. oligonucleotides that binds to two 
or more TOXMARKER nucleic acids. For example, the array contains oligonucleotides that 
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bind TOXMARKER 1-132. Alternatively, the array contains oligonucleotides that bind the 
TOXMARKER genes listed on Table 5. Preferably, the array contains oligonucleotides that bind 
TOXMARKER 42, 59, 65, 66, 71, 76, and 97 Most preferably, the array contains 
oligonucleotides that binds at least five TOXMARKER genes listed one Table 5, where the 
5 collection of TOXMARKER genes predict toxicity to a confidence level of a p-value of at least 
0.05 or less. 

Unless otherwise defined, all technical and scientific terms used herein have the same 
meaning as commonly understood by one of ordinary skill in the art to which this invention 
belongs. Although methods and materials similar or equivalent to those described herein can be 

1 0 used in the practice or testing of the present invention, suitable methods and materials are 

described below. All publications, patent applications, patents, and other references mentioned 
herein are incorporated by reference in their entirety. In the case of conflict, the present 
specification, including definitions, will control In addition, the materials, methods, and 
examples are illustrative only and not intended to be limiting. 

1 5 Other features and advantages of the invention will be apparent from the following 

detailed description and claims. 

Brief Description of the Drawing 

FIG. 1 is a schmatic of the hepatotoxicity prediction screening method of the invention. 
FIG. 2 is a illustration of a chart showing the linear discriminant model. 



20 Detailed Description of the Invention 

The present invention is based in part on the discovery of changes in expression patterns 
of multiple nucleic acid sequences in response to compounds known to elicit a histopathologic 
condition (i.e., a pathologic change to the liver visible on examination that causes the liver to 
function less effectively) associated with liver toxicity. The compounds are for example, 
25 compounds listed in Table 1 or 2 below. Examples of histopathologic conditions include, 
Cholestasis; Steatosis; Reactive Inflamation; Necrosis, e.g., zone 3, general or multifocal; 
Genotoxic Carcinogenesis; Hepatocellular Hypertrophy; Non-Genotoxic Carcinogenesis; 
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Appoptosis and Kupffer Cell Aggregation. The histopathological conditions are identified by 
methods know in the art. For example, zone 3 necrosis is identified by choleastasis and 
hypertrophy. The identification of genes that are differentially expressed in response to toxic 
compounds are useful in screening potential therapeutic compositions for toxicity. 

5 The genes whose expression levels are modulated (i.e., increased or decreased) in 

response to exposure to a toxic compound are summarized in Tables 3-5 (see EXAMPLES 4 and 
5) and are collectively referred to herein as "toxicity-associated gene", "TOXMARKER nucleic 
acids" or " TOXMARKER polynucleotides" and the corresponding encoded polypeptides are 
referred to as "TOXMARKER polypeptides" or " TOXMARKER proteins." Unless indicated 
10 otherwise, " TOXMARKER " or "toxicity -associated gene" is meant to refer to any of the 
sequences disclosed herein. 

For a given TOXMARKER sequence, its expression can be measured in the methods 
described herein. For previously described sequences, database accession numbers are provided. 
This information allows for one of ordinary skill in the art to deduce information necessary for 
1 5 detecting and measuring expression of the TOXMARKER nucleic acid sequences. 

General Methods 

The TOXMARKER nucleic acids and encoded polypeptides can be identified using the 
information provided in the EXAMPLES below. In some embodiments, the TOXMARKER 
nucleic acids and polypeptides correspond to the nucleic acids or polypeptides which include the 
20 various sequences (referenced by SEQ ID NOs) disclosed for each TOXMARKER. 

The invention includes providing a test cell population which includes at least one cell 
that is capable of expressing one or more of the sequences TOXMARKER 1-132. By "capable 
of expressing" is meant that the gene is present in an intact form in the cell and can be expressed. 
Expression of one, some, or all of the TOXMARKER sequences is then detected, if present, and, 
25 preferably, measured to yield an expression profile, e.g., subject expression profile or a test cell 
expression profile. By "expression profile" is meant a pattern of the level of expression of at least 
two toxicity-associated genes. 

"Similarity of expression profilers a similarity of expression profile between two 
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samples exists when the linear combination of the genes in the profile has a linear discriminant 
score that is more similar to one of the training classes than the other. Linear discriminant 
analysis (LDA) identifies a linear combination of markers that best separates the defined classes. 
In the training data (i.e., control population) of this invention linear discriminant score could be 
5 determined by the following equation: 

Linear disc, score = aGenel + bGene2 + ... + nGeneN 

where a, b ... n are the coefficients identified by least squares that best separate the 
phenotypes under investigation. Thus, similarity in expression profile is a similarity in gene 
combinations. Interpretation of raw data is difficult since the samples are plotted in more than 3 
10 dimensions, one dimension for each gene, which makes it difficult to visualize the data. LDA 
compresses this information into a single dimension. 

By "toxicity-associated gene" is meant a gene, which the level of expression differs in a 
cell or subject exposed to a known toxic compound as compared to a cell or subject not exposed 
to a toxic compound (i.e., control). Preferably, the TOXMARKER genes 42 (IFNAR-2), 59 
15 (Transaldolase), 65 (Clp-1), 66 (Hex), 71 (cszr_204 152648J 9 1521095), 76 (scrjb- 
aa899865_3), and 97 ( scr_gb-bm986259 J). 

Using sequence information provided by the database entries for the known sequences, or 
the sequence information provided herein for the newly described sequences, expression of the 
TOXMARKER sequences are detected (if present) and measured using techniques well known to 

20 one of ordinary skill in the art. For example, sequences within the sequence database entries 
corresponding to TOXMARKER sequences, or within the sequences disclosed herein, can be 
used to construct probes for detecting TOXMARKER RNA sequences in, e.g., northern blot 
hybridization analyses or methods which specifically, and, preferably, quantitatively amplify 
specific nucleic acid sequences. As another example, the sequences can be used to construct 

25 primers for specifically amplifying the TOXMARKER sequences in, e.g. , amplification-based 
detection methods such as reverse-transcription based polymerase chain reaction. When 
alterations in gene expression are associated with gene amplification or deletion, sequence 
comparisons in test and reference populations can be made by comparing relative amounts of the 
examined DNA sequences in the test and reference cell populations. 
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Expression of the genes disclosed herein can be measured at the RNA level using any 
method known in the art. For example, northern hybridization analysis using probes which 
specifically recognize one or more of these sequences can be used to determine gene expression. 
Alternatively, expression can be measured using reverse-transcription-based PCR assays, e.g., 
5 using primers specific for the differentially expressed sequences. 

Expression is also measured at the protein level, i.e., by measuring the levels of 
polypeptides encoded by the gene products described herein. Such methods are well known in 
the art and include, e.g., immunoassays based on antibodies to proteins encoded by the genes. 

Expression levels of one or more of the TOXMARKER sequences in the test cell 
10 population are then compared to expression levels of the sequences in one or more cells from a 
reference (i.e., control) cell population. If desired, a reference expression profile is generated. A 
reference profile is a single expression pattern derived from a single reference population or from 
a plurality of expression patterns. For example, the reference cell population can be a database of 
expression patterns from previously tested cells for which one of the herein-described parameters 
15 or conditions (e.g. , toxicity) is known. 

The reference profile is obtained from the training data, Training data is a collection of 
data from the in vitro or in vivo samples that were exposed to compounds that produce a known 
pathology, (i.e., pathology present or pathology absent) Profile is defined here to indicate the 
absolute estimate of the expression level of any one TOXMARKER gene fragment (e.g. 
20 Intensity). 

Expression of sequences in test and reference populations of cells are compared using any 
art-recognized method for comparing expression of nucleic acid sequences. For example, 
expression can be compared using GENECALLING® methods as described in US Patent No. 
5,871,697 and in Shimkets et al., Nat. Biotechnol. 17:798-803. 

25 In various embodiments, the expression of one or more sequences encoding genes of 

related function, as listed in Tables 3-5, is compared. In various embodiments, the expression of 
2, 3, 4, 5, 6, 7,8, 9, 10, 15, 20, 25, 35, 40, 50, 100 or all of the sequences represented by 
TOXMARKER 1-132 are measured. If desired, expression of these sequences can be measured 
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along with other sequences whose expression is known to be altered according to one of the 
herein described parameters or conditions. 

The reference cell population includes one or more cells for which the compared 
parameter is known. The compared parameter can be, e.g., toxic agent expression status. By 
5 "toxic agent expression status" is meant that it is known whether the reference cell has had 
contact with a toxic agent. Whether or not comparison of the gene expression profile in the test 
cell population to the reference cell population reveals the presence, or degree, of the measured 
parameter depends on the composition of the reference cell population. For example, if the 
reference cell population is composed of cells that have not been treated with a known toxic 
10 agent, a similar gene expression level in the test cell population and a reference cell population 
indicates the test agent is not a toxic agent. Conversely, if the reference cell population is made 
up of cells that have been treated with a toxic agent, a similar gene expression profile between 
the test cell population and the reference cell population indicates the test agent is a toxic agent. 

In various embodiments, a TOXMARKER sequence in a test cell population is 
1 5 considered comparable in expression level to the expression level of the TOXMARKER 

sequence if its expression level varies within a factor of 2.0, 1.5, or 1.0 fold to the level of the 
TOXMARKER transcript in the reference cell population. In various embodiments, a 
TOXMARKER sequence in a test cell population can be considered altered in levels of 
expression if its expression level varies from the reference cell population by more than 3.0, 4.0, 
20 5.0 or more fold from the expression level of the corresponding TOXMARKER sequence in the 
reference cell population. 

If desired, comparison of differentially expressed sequences between a test cell 
population and a reference cell population can be done with respect to a control nucleic acid 
whose expression is independent of the parameter or condition being measured. Expression 
25 levels of the control nucleic acid in the test and reference nucleic acid can be used to normalize 
signal levels in the compared populations. 

In some embodiments, the test cell population is compared to multiple reference cell 
populations. Each of the multiple reference populations may differ in the known parameter. 
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Thus, a test cell population may be compared to a first reference cell population known to have 
been exposed to a toxic agent, as well as a second reference population known to have not been 
exposed to a toxic agent. 

The test cell population that is exposed to, i.e., contacted with, the test toxic agent can be 
5 any number of cells, i.e. , one or more cells, and can be provided in vitro, in vivo, or ex vivo. The 
test cell is obtained from a bodily fluid, e.g., biological fluid (such as blood, serum, urine, saliva, 
milk, ductal fluid, or tears). For example, the test cell is purified from blood or another tissue, 
i.e., liver tissue. 

In other embodiments, the test cell population can be divided into two or more 
1 0 subpopulations. The subpopulations can be created by dividing the first population of cells to 
create as identical a subpopulation as possible. This will be suitable, in, for example, in vitro or 
ex vivo screening methods. In some embodiments, various subpopulations can be exposed to a 
control agent, and/or a test agent, multiple test agents, or, e.g., varying dosages of one or multiple 
test agents administered together, or in various combinations. 

1 5 Preferably, cells in the reference cell population are derived from a tissue type as similar 

as possible to the test cell, e.g., liver tissue. Alternatively the cells are derived from an 
established cell line. Preferably, the cell is a hepatocyte. In some embodiments, the control cell 
is derived from the same subject as the test cell, e.g., from a region proximal to the region of 
origin of the test cell. In other embodiments, the reference cell population is derived from a 

20 plurality of cells. For example, the reference cell population can be a database of expression 
patterns from previously tested cells for which one of the herein-described parameters or 
conditions (toxic agent expression status) is known. 

The test agent can be a compound not previously described or can be a previously known 
compound but which is not known to be a toxic agent. 

25 The subject is preferably a mammal. The mammal can be, e.g., a human, non-human 

primate, mouse, rat, dog, cat, horse, or cow. 
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Prediction of Toxicity 

In one aspect, the invention provides a method of predicting the toxicity e.g., 
hepatotoxicity of a test agent or identifying a toxic agents, e.g., a hepatotoxic agent. The method 
is an in vivo method. Alternatively, the method is an in vitro method. 

5 Differences in the genetic makeup of individuals can result in differences in their relative 

abilities to metabolize various drugs. Accordingly, the differentially expressed TOXMARKER 
sequences disclosed herein allow for a putative therapeutic or prophylactic agent to be tested in a 
test cell population from a selected subject in order to predict if the agent causes toxicity in the 
subject. 

10 By predicting the toxicity is meant that the test compound is more likely to be hepatotoxic 

that not be hepatotoxic. Hepatotoxicity is predicted by determining the level of expression of a 
toxicity-associated gene in a cell exposed to a test agent. The level of expression of the toxicity- 
associated gene is compared to the level of expression of the toxicity-associated gene in a control 
population exposed to a control agent. A test agent is predicted to be toxic if an alteration (e.g., 

1 5 increase or decrease) in the level of expression in the cell exposed to the test agent compared to 
the control population is identified. 

The toxicity-associated gene is for example TOXMARKER 1-132. Alternatively, the 
toxicity-associated gene is the TOXMARKER genes listed on Table 5. Optionally, the toxicity- 
associated gene is TOXMARKER 42, 59, 65, 66, 71, 76, and 97 The toxicity-associated gene is 
20 a nucleic acid sequences homologous to those listed in Tables 3-5 as TOXMARKER 1-132. The 
sequences need not be identical to sequences including TOXMARKER 1-132, as long as the 
sequence is sufficiently similar that specific hybridization can be detected. Preferably, the cell 
includes sequences that are identical, or nearly identical to those identifying the TOXMARKER 
nucleic acids shown in Tables 3-5. 

25 By hepatotoxicity is meant that that the compound causes a hispathological change in the 

live tissue associate with toxicity. By "toxicity" is meant that the agent is damaging or 
destructive to liver when administered to a subject. Damage to the liver is measured for 
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example, histologically. Hepatotoxicity is determined, for example as described in the examples 
below. 

The cell population is contacted in vitro, or in vivo. Optionally, the cell population is 
contacted ex vivo with the agent or activated form of the agent. 

5 Expression of the nucleic acid sequences in the test cell population is then compared to 

the expression of the nucleic acid sequences in a control population, which is a cell population 
that has not been exposed to the test agent, or, in some embodiments, a cell population exposed 
to the test agent. Comparison can be performed on test and reference samples measured 
concurrently or at temporally distinct times. An example of the latter is the use of compiled 

10 expression information, e.g., a sequence database, which assembles information about expression 
levels of known sequences following administration of various agents. For example, alteration 
of expression levels following administration of test agent can be compared to the expression 
changes observed in the nucleic acid sequences following administration of a control agent. A 
control agent is a compound that elicits the histopathology. Alternatively, the control agent is a. 

1 5 compound that does not elicit the histopathology. Exemplary control compounds are listed in 
Tables 1 and 2. 

An alteration in expression of the nucleic acid sequence in the test cell population 
compared to the expression of the nucleic acid sequence in the control cell population that has 
not been exposed to the test agent indicates the test agent is a toxic agent. 

20 The alteration is statistically significant. By statistically significant is meant that the 

alteration is greater than what might be expected to happen by change alone. Statistical 
significance is determined by method known in the art. For example statistical significance is 
determined by p-value. The p-values is a measure of probability that a difference between groups 
during an experiment happened by chance. (P(z>z 0 b S erved)). For example, a p-value of 0.01 means 

25 that there is a 1 in 100 chance the result occurred by chance. The lower the p-value, the more 
likely it is that the difference between groups was caused by treatment. An alteration is 
statistically significant if the p-value is at least 0.05. Preferably, the p-value is 0.04, 0.03, 0.02, 
0.01,0.005, 0.001 or less. 
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The invention also includes a toxic agent identified according to this screening method. 

The differentially expressed TOXMARKER sequences identified herein also allow for 
the toxicity of a toxic agent to be determined or monitored. In this method, a test cell population 
from a subject is exposed to a test agent, i.e. a. toxic agent. If desired, test cell populations can 
5 be taken from the subject at various time points before, during, or after exposure to the test agent. 
Expression of one or more of the TOXMARKER sequences, e.g., TOXMARKER: 1-132, in the 
cell population is then measured and compared to a control population which includes cells 
whose toxic agent expression status is known. 

Kits 

The invention also includes a TOXMARKER-detection reagent, e.g., nucleic acids that 
specifically identify one or more TOXMARKER nucleic acids by having homologous nucleic 
acid sequences, such as oligonucleotide sequences, complementary to a portion of the 
TOXMARKER nucleic acids or antibodies to proteins encoded by the TOXMARKER nucleic 
acids packaged together in the form of a kit. The oligonucleotides are fragments of the the 
TOXMARKER genes. For example the olignucleitides are 200, 150, 100, 50, 25, 10 or less 
nucleotides in length. The kit may contain in separate containers a nucleic acid or antibody 
(either already bound to a solid matrix or packaged separately with reagents for binding them to 
the matrix) , control formulations (positive and/or negative), and/or a detectable label. 
Instructions (e.g., written, tape, VCR, CD-ROM, etc.) for carrying out the assay may be included 
in the kit. The assay may for example be in the form of a Northern hybridization or a sandwich 
ELIS A as known in the art. 

For example, TOXMARKER detection reagent, is immobilized on a solid matrix such as 
a porous strip to form at least one TOXMARKER detection site. The measurement or detection 
region of the porous strip may include a plurality of sites containing a nucleic acid. A test strip 
25 may also contain sites for negative and/or positive controls. Alternatively, control sites are 
located on a separate strip from the test strip. Optionally, the different detection sites may 
contain different amounts of immobilized nucleic acids, i.e., a higher amount in the first 
detection site and lesser amounts in subsequent sites. Upon the addition of test sample, the 
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number of sites displaying a detectable signal provides a quantitative indication of the amount of 
TOXMARKER present in the sample. The detection sites may be configured in any suitably 
detectable shape and are typically in the shape of a bar or dot spanning the width of a test strip. 

Alternatively, the kit contains a nucleic acid substrate array comprising one or more 
5 nucleic acid sequences. The nucleic acids on the array specifically identify one or more nucleic 
acid sequences represented by TOXMARKER 1-132. In various embodiments, the expression of 
2, 3,4, 5, 6, 7,8, 9, 10, 15, 20, 25, 40 or 50 or more of the sequences represented by 
TOXMARKER 1-132. are identified by virtue of binding to the array. The substrate array can be 
on, e.g., a solid substrate, e.g., a "chip" as described in U.S. Patent No.5,744,305. 

1 0 Arrays and pluralities 

The invention also includes a nucleic acid substrate array comprising one or more nucleic 
acid sequences. The nucleic acids on the array specifically identify one or more nucleic acid 
sequences represented by TOXMARKER 1-132. In various embodiments, the expression of 2, 
3,4, 5, 6, 7,8, 9, 10, 15, 20, 25, 40 or 50 or more of the sequences represented by TOXMARKER 
15 1-132 are identified. 

The nucleic acids in the array can identify the enumerated nucleic acids by, e.g., having 
homologous nucleic acid sequences, such as oligonucleotide sequences, complementary to a 
portion of the recited nucleic acids. The substrate array can be on, e.g., a solid substrate, e.g., a 
"chip" as described in U.S. Patent No.5,744,305. 

20 The invention also includes an isolated plurality (i.e., a mixture of two or more nucleic 

acids) of nucleic acid sequences. The nucleic acid sequence can be in a liquid phase or a solid 
phase, e.g., immobilized on a solid support such as a nitrocellulose membrane. The plurality 
typically includes one or more of the nucleic acid sequences represented by TOXMARKER 1- 
132. In various embodiments, the plurality includes 2, 3,4, 5, 6, 7,8, 9, 10, 15, 20, 25, 40 or 50 or 

25 more of the sequences represented by TOXMARKER 1-132. 
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Nucleic Acids 

One aspect of the invention pertains to isolated nucleic acid molecules that encode 
TOXMARKER proteins or biologically active portions thereof, as well as nucleic acid fragments 
sufficient for use as hybridization probes to identify TOXMARKER-encoding nucleic acids (e.g., 
5 TOXMARKER mRNA) and fragments for use as PCR primers for the amplification or mutation 
of TOXMARKER nucleic acid molecules. As used herein, the term "nucleic acid molecule" is 
intended to include DNA molecules (e.g., cDNA or genomic DNA), RNA molecules (e.g., 
mRNA), analogs of the DNA or RNA generated using nucleotide analogs, and derivatives, 
fragments and homologs thereof. The nucleic acid molecule can be single-stranded or 
1 0 double-stranded, but preferably is double-stranded DNA. 

"Probes" refer to nucleic acid sequences of variable length, preferably between at least 
about 10 nucleotides (nt), 100 nt, or as many as about, e.g., 6,000 nt, depending on use. Probes 
are used in the detection of identical, similar, or complementary nucleic acid sequences. Longer 
length probes are usually obtained from a natural or recombinant source, are highly specific and 
1 5 much slower to hybridize than oligomers. Probes may be single- or double-stranded and designed 
to have specificity in PCR, membrane-based hybridization technologies, or ELISA-like 
technologies. 

An "isolated" nucleic acid molecule is one that is separated from other nucleic acid 
molecules which are present in the natural source of the nucleic acid. Preferably, an "isolated" 

20 nucleic acid is free of sequences which naturally flank the nucleic acid (i.e., sequences located at 
the 5' and 3' ends of the nucleic acid) in the genomic DNA of the organism from which the 
nucleic acid is derived. For example, in various embodiments, the isolated TOXMARKER 
nucleic acid molecule can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of 
nucleotide sequences which naturally flank the nucleic acid molecule in genomic DNA of the cell 

25 from which the nucleic acid is derived (e.g., testis, lung, B-cells). Moreover, an "isolated" nucleic 
acid molecule, such as a cDNA molecule, can be substantially free of other cellular material or 
culture medium when produced by recombinant techniques, or of chemical precursors or other 
chemicals when chemically synthesized. 
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A nucleic acid molecule of the present invention, e.g., a nucleic acid molecule having the 
nucleotide sequence of SEQ ID NO: 1-171 or a complement of any of these nucleotide 
sequences, can be isolated using standard molecular biology techniques and the sequence 
information provided herein. Using all or a portion of the nucleic acid sequences of SEQ ID NO: 
5 1-171 as a hybridization probe, TOXMARKER molecules can be isolated using standard 

hybridization and cloning techniques (e.g., as described in Sambrook et aL, (eds.), Molecular 
Cloning: A Laboratory Manual 2 nd Ed., Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY, 1989; and Ausubel, et ai, (eds.), CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, 
John Wiley & Sons, New York, NY, 1993.) 

10 A nucleic acid of the invention can be amplified using cDNA, mRNA or alternatively, 

genomic DNA, as a template and appropriate oligonucleotide primers according to standard PCR 
amplification techniques. The nucleic acid so amplified can be cloned into an appropriate vector 
and characterized by DNA sequence analysis. Furthermore, oligonucleotides corresponding to 
TOXMARKER nucleotide sequences can be prepared by standard synthetic techniques, e.g., 

1 5 using an automated DNA synthesizer. 

As used herein, the term "oligonucleotide" refers to a series of linked nucleotide residues, 
which oligonucleotide has a sufficient number of nucleotide bases to be used in a PCR reaction. 
A short oligonucleotide sequence may be based on, or designed from, a genomic or cDNA 
sequence and is used to amplify, confirm, or reveal the presence of an identical, similar or 

20 complementary DNA or RNA in a particular cell or tissue. Oligonucleotides comprise portions of 
a nucleic acid sequence having about 10 nt, 50 nt, or 100 nt in length, preferably about 15 nt to 
30 nt in length. In one embodiment, an oligonucleotide comprising a nucleic acid molecule less 
than 100 nt in length would further comprise at lease 6 contiguous nucleotides of SEQ ID NO: 1- 
171, or a complement thereof. Oligonucleotides may be chemically synthesized and may be used 

25 as probes. 

In another embodiment, an isolated nucleic acid molecule of the invention comprises a 
nucleic acid molecule that is a complement of the nucleotide sequence shown in SEQ ID NO: 1- 
171. In another embodiment, an isolated nucleic acid molecule of the invention comprises a 
nucleic acid molecule that is a complement of the nucleotide sequence shown in SEQ ID NO: 1- 
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171, or a portion of this nucleotide sequence. A nucleic acid molecule that is complementary to 
the nucleotide sequence shown in SEQ ID NO: 1-171 is one that is sufficiently complementary to 
the nucleotide sequence shown in SEQ ID NO: 1-171 that it can hydrogen bond with little or no 
mismatches to the nucleotide sequence shown in SEQ ID NO: 1-171, thereby forming a stable 
5 duplex. 

As used herein, the term "complementary" refers to Watson-Crick or Hoogsteen base 
pairing between nucleotides units of a nucleic acid molecule, and the term "binding" means the 
physical or chemical interaction between two polypeptides or compounds or associated 
polypeptides or compounds or combinations thereof. Binding includes ionic, non-ionic, Von der 
10 Waals, hydrophobic interactions, etc. A physical interaction can be either direct or indirect. 
Indirect interactions may be through or due to the effects of another polypeptide or compound. 
Direct binding refers to interactions that do not take place through, or due to, the effect of another 
polypeptide or compound, but instead are without other substantial chemical intermediates. 

Moreover, the nucleic acid molecule of the invention can comprise only a portion of the 
1 5 nucleic acid sequence of SEQ ID NO: 1 - 1 7 1 , e.g. , a fragment that can be used as a probe or 
primer or a fragment encoding a biologically active portion of TOXMARKER. 

Fragments provided herein are defined as sequences of at least 6 (contiguous) nucleic 
acids or at least 4 (contiguous) amino acids, a length sufficient to allow for specific hybridization 
in the case of nucleic acids or for specific recognition of an epitope in the case of amino acids, 

20 respectively, and are at most some portion less than a full length sequence. Fragments may be 
derived from any contiguous portion of a nucleic acid or amino acid sequence of choice. 
Derivatives are nucleic acid sequences or amino acid sequences formed from the native 
compounds either directly or by modification or partial substitution. Analogs are nucleic acid 
sequences or amino acid sequences that have a structure similar to, but not identical to, the native 

25 compound but differs from it in respect to certain components or side chains. Analogs may be 
synthetic or from a different evolutionary origin and may have a similar or opposite metabolic 
activity compared to wild type. Homologs are nucleic acid sequences or amino acid sequences of 
a particular gene that are derived from different species. 
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Derivatives and analogs may be full length or other than full length, if the derivative or 
analog contains a modified nucleic acid or amino acid, as described below. Derivatives or 
analogs of the nucleic acids or proteins of the invention include, but are not limited to, molecules 
comprising regions that are substantially homologous to the nucleic acids or proteins of the 
invention, in various embodiments, by at least about 30%, 50%, 70%, 80%, or 95% identity (with 
a preferred identity of 80-95%) over a nucleic acid or amino acid sequence of identical size or 
when compared to an aligned sequence in which the alignment is done by a computer homology 
program known in the art, or whose encoding nucleic acid is capable of hybridizing to the 
complement of a sequence encoding the aforementioned proteins under stringent, moderately 
stringent, or low stringent conditions. See e.g. Ausubel, et ah, Current Protocols in 
Molecular Biology, John Wiley & Sons, New York, NY, 1993, and below. 

A "homologous nucleic acid sequence" or "homologous amino acid sequence," or 
variations thereof, refer to sequences characterized by a homology at the nucleotide level or 
amino acid level as discussed above. Homologous nucleotide sequences encode those sequences 
1 5 coding for isoforms of TOXMARKER polypeptide. Isoforms can be expressed in different 
tissues of the same organism as a result of, for example, alternative splicing of RNA. 
Alternatively, isoforms can be encoded by different genes. In the present invention, homologous 
nucleotide sequences include nucleotide sequences encoding for a TOXMARKER polypeptide of 
species other than humans, including, but not limited to, mammals, and thus can include, e.g., 
20 mouse, rat, rabbit, dog, cat cow, horse, and other organisms. Homologous nucleotide sequences 
also include, but are not limited to, naturally occurring allelic variations and mutations of the 
nucleotide sequences set forth herein. A homologous nucleotide sequence does not, however, 
include the nucleotide sequence encoding human TOXMARKER protein. Homologous nucleic 
acid sequences include those nucleic acid sequences that encode conservative amino acid 
25 substitutions (see below) in SEQ ID NO: 1-171, as well as a polypeptide having TOXMARKER 
activity. Biological activities of the TOXMARKER proteins are described below. A homologous 
amino acid sequence does not encode the amino acid sequence of a human TOXMARKER 
polypeptide. 
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An TOXMARKER polypeptide is encoded by the open reading frame ("ORF") of a 
TOXMARKER nucleic acid. An "open reading frame" ("ORF") corresponds to a nucleotide 
sequence that could potentially be translated into a polypeptide. A stretch of nucleic acids 
comprising an ORF is uninterrupted by a stop codon. An ORF that represents the coding 
5 sequence for a full protein begins with an ATG "start" codon and terminates with one of the three 
"stop" codons, namely, TAA, TAG, or TGA. For the purposes of this invention, an ORF may be 
any part of a coding sequence, with or without a start codon, a stop codon, or both. For an ORF 
to be considered as a good candidate for coding for a bona fide cellular protein, a minimum size 
requirement is often set, for example, a stretch of DNA that would encode a protein of 50 amino 
10 acids or more. 

The nucleotide sequence determined from the cloning of the human TOXMARKER gene 
allows for the generation of probes and primers designed for use in identifying and/or cloning 
TOXMARKER homologues in other cell types, e.g. from other tissues, as well as 
TOXMARKER homologues from other mammals. The probe/primer typically comprises 
1 5 substantially purified oligonucleotide. The oligonucleotide typically comprises a region of 

nucleotide sequence that hybridizes under stringent conditions to at least about 12, 25, 50, 100, 
150, 200, 250, 300, 350 or 400 consecutive sense strand nucleotide sequence of SEQ ID NO: 1- 
171, or an anti-sense strand nucleotide sequence of SEQ ID NO: 1-171 or of a naturally occurring 
mutant of SEQ ED NO: 1-171. 

20 Probes based on the human TOXMARKER nucleotide sequence can be used to detect 

transcripts or genomic sequences encoding the same or homologous proteins. In various 
embodiments, the probe further comprises a label group attached thereto, e.g. the label group can 
be a radioisotope, a fluorescent compound, an enzyme, or an enzyme co-factor. Such probes can 
be used as a part of a diagnostic test kit for identifying cells or tissue which misexpress a 

25 TOXMARKER protein, such as by measuring a level of a TOXMARKER-encoding nucleic acid 
in a sample of cells from a subject e.g., detecting TOXMARKER mRNA levels or determining 
whether a genomic TOXMARKER gene has been mutated or deleted. 

"A polypeptide having a biologically active portion of TOXMARKER" refers to 
polypeptides exhibiting activity similar, but not necessarily identical to, an activity of a 
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polypeptide of the present invention, including mature forms, as measured in a particular 
biological assay, with or without dose dependency. A nucleic acid fragment encoding a 
"biologically active portion of TOXMARKER" can be prepared by isolating a portion of SEQ ID 
NO: 1-171 that encodes a polypeptide having a TOXMARKER biological activity (the biological 
5 activities of the TOXMARKER proteins are described below), expressing the encoded portion of 
TOXMARKER protein (e.g., by recombinant expression in vitro) and assessing the activity of 
the encoded portion of TOXMARKER. 

TOXMARKER variants 

The invention further encompasses nucleic acid molecules that differ from the nucleotide 
10 sequence shown in SEQ ID NO: 1-171 due to degeneracy of the genetic code and thus encode the 
same TOXMARKER protein as that encoded by the nucleotide sequence shown in SEQ ID NO: 
1-171. 

In addition to the human TOXMARKER nucleotide sequence shown in SEQ ID NO: 1- 
171 it will be appreciated by those skilled in the art that DNA sequence polymorphisms that lead 

1 5 to changes in the amino acid sequences of TOXMARKER may exist within a population (e.g., 
the human population). Such genetic polymorphism in the TOXMARKER gene may exist among 
individuals within a population due to natural allelic variation. As used herein, the terms "gene" 
and "recombinant gene" refer to nucleic acid molecules comprising an open reading frame 
encoding a TOXMARKER protein, preferably a mammalian TOXMARKER protein. Such 

20 natural allelic variations can typically result in 1-5% variance in the nucleotide sequence of the 
TOXMARKER gene. Any and all such nucleotide variations and resulting amino acid 
polymorphisms in TOXMARKER that are the result of natural allelic variation and that do not 
alter the functional activity of TOXMARKER are intended to be within the scope of the 
invention. 

25 Moreover, nucleic acid molecules encoding TOXMARKER proteins from other species, 

and thus that have a nucleotide sequence that differs from the human sequence of SEQ ID NO: 1- 
171 are intended to be within the scope of the invention. Nucleic acid molecules corresponding 
to natural allelic variants and homologues of the TOXMARKER cDNAs of the invention can be 
isolated based on their homology to the human TOXMARKER nucleic acids disclosed herein 
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using the human cDNAs, or a portion thereof, as a hybridization probe according to standard 
hybridization techniques under stringent hybridization conditions. For example, a soluble human 
TOXMARKER cDNA can be isolated based on its homology to human membrane-bound 
TOXMARKER. Likewise, a membrane-bound human TOXMARKER cDNA can be isolated 
5 based on its homology to soluble human TOXMARKER. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the invention is 
at least 6 nucleotides in length and hybridizes under stringent conditions to the nucleic acid 
molecule comprising the nucleotide sequence of SEQ ID NO: 1-171. In another embodiment, the 
nucleic acid is at least 10, 25, 50, 100, 250, 500, 750, lOOOor 1250 nucleotides in length. In 
10 another embodiment, an isolated nucleic acid molecule of the invention hybridizes to the coding 
region. As used herein, the term "hybridizes under stringent conditions" is intended to describe 
conditions for hybridization and washing under which nucleotide sequences at least 60% 
homologous to each other typically remain hybridized to each other. 

Homologs (i.e., nucleic acids encoding TOXMARKER proteins derived from species 
1 5 other than human) or other related sequences (e.g., paralogs) can be obtained by low, moderate or 
high stringency hybridization with all or a portion of the particular human sequence as a probe 
using methods well known in the art for nucleic acid hybridization and cloning. 

As used herein, the phrase "stringent hybridization conditions" refers to conditions under 
which a probe, primer or oligonucleotide will hybridize to its target sequence, but to no other 

20 sequences. Stringent conditions are sequence-dependent and will be different in different 
circumstances. Longer sequences hybridize specifically at higher temperatures than shorter 
sequences. Generally, stringent conditions are selected to be about 5°C lower than the thermal 
melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the 
temperature (under defined ionic strength, pH and nucleic acid concentration) at which 50% of 

25 the probes complementary to the target sequence hybridize to the target sequence at equilibrium. 
Since the target sequences are generally present at excess, at Tm, 50% of the probes are occupied 
at equilibrium. Typically, stringent conditions will be those in which the salt concentration is less 
than about 1.0 M sodium ion, typically about 0.01 to 1.0 M sodium ion (or other salts) at pH 7.0 
to 8.3 and the temperature is at least about 30°C for short probes, primers or oligonucleotides 
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(e.g., 10 nt to 50 nt) and at least about 60°C for longer probes, primers and oligonucleotides. 
Stringent conditions may also be achieved with the addition of destabilizing agents, such as 
formamide. 

Stringent conditions are known to those skilled in the art and can be found in Ausubel et 
5 al, (eds.), Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 
6.3.1-6.3.6. Preferably, the conditions are such that sequences at least about 65%, 70%, 75%, 
85%, 90%, 95%, 98%, or 99% homologous to each other typically remain hybridized to each 
other. A non-limiting example of stringent hybridization conditions are hybridization in a high 
salt buffer comprising 6X SSC, 50 mM Tris-HCl (pH 7.5), 1 mM EDTA, 0.02% PVP, 0.02% 

10 Ficoll, 0.02% BSA, and 500 mg/ml denatured salmon sperm DNA at 65°C, followed by one or 
more washes in 0.2X SSC, 0.01% BSA at 50°C. An isolated nucleic acid molecule of the 
invention that hybridizes under stringent conditions to the sequence of SEQ ID NO: 1-171 
corresponds to a naturally-occurring nucleic acid molecule. As used herein, a 
"naturally-occurring" nucleic acid molecule refers to an RNA or DNA molecule having a 

15 nucleotide sequence that occurs in nature (e.g., encodes a natural protein). 

In a second embodiment, a nucleic acid sequence that is hybridizable to the nucleic acid 
molecule comprising the nucleotide sequence of SEQ ID NO: 1-171 , or fragments, analogs or 
derivatives thereof, under conditions of moderate stringency is provided. A non-limiting example 
of moderate stringency hybridization conditions are hybridization in 6X SSC, 5X Denhardt's 
20 solution, 0.5% SDS and 100 mg/ml denatured salmon sperm DNA at 55°C, followed by one or 
more washes in IX SSC, 0.1% SDS at 37°C. Other conditions of moderate stringency that may 
be used are well-known in the art. See, e.g., Ausubel et al (eds.), 1993, Current Protocols in 
Molecular Biology, John Wiley & Sons, NY, and Kriegler, 1990, Gene Transfer and 
Expression, A Laboratory Manual, Stockton Press, NY. 

25 In a third embodiment, a nucleic acid that is hybridizable to the nucleic acid molecule 

comprising the nucleotide sequence of SEQ ID NO: 1-171, or fragments, analogs or derivatives 
thereof, under conditions of low stringency, is provided. A non-limiting example of low 
stringency hybridization conditions are hybridization in 35% formamide, 5X SSC, 50 mM 
Tris-HCl (pH 7.5), 5 mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100 mg/ml denatured 
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salmon sperm DNA, 10% (wt/vol) dextran sulfate at 40°C, followed by one or more washes in 
2X SSC, 25 mM Tris-HCl (pH 7.4), 5 mM EDTA, and 0.1% SDS at 50°C. Other conditions of 

TRA 1832808vl 

low stringency that may be used are well known in the art (e.g., as employed for cross-species 
hybridizations). See, e.g., Ausubel et al. (eds.), 1993, Current Protocols in Molecular 
5 Biology, John Wiley & Sons, NY, and Kriegler, 1990, Gene Transfer and Expression, A 
Laboratory Manual, Stockton Press, NY; Shilo and Weinberg, 1981, Proc Natl Acad Sci USA 
78: 6789-6792. 

Examples 

Example 1: Induction of Zone 3 Necrosis in vivo 

10 Over 100 compounds were chosen based on evidence that they elicit one of eleven 

selected histopathology subtypes. The criteria for inclusion of a compound into the nongenotoxic 
carcinogens mode included evidence of parenchymal changes and an increase in mitosis in vivo. 
Compounds assigned to this group must also have strong historical documentation. Compounds 
from other pathology modes were not added to this histopathology subtype. The compounds 

1 5 included in nongenotoxic carcinogenesis can been seen in Table 1 . Each compound was 

delivered orally on a daily basis at a high dose (tox dose) and a 1/10 low dose (mode dose) for up 
to 14 days. Five male rats/dose/time were randomly assigned to sacrifice on days 1, 3, 7, and 14. 
In order to best identify genes characteristic of the histopathology subtype, total RNA for all rat 
livers from a given dose time point were pooled and converted to mRNA and cDNA for 

20 GeneCalling®. In GeneCalling, the cDNA is cut with a battery of restriction enzyme pairs in 
different combinations followed by amplification by PCR using specific primers linked to 
specific adaptors. After gel electrophoresis, the resulting fragments are identified based on the 
inherent information in the cDNA fragment: The flanking restriction site sequences on the ends, 
the size of the fragment and the species (and sometimes the tissue) origin of the DNA. This 

25 information is used to query public and proprietary databases. The fragments that do not match 
any sequences in the database are isolated, sequenced and identified as novel. 

Table 1: Zone 3 Necrosis in vivo 



Compound 


V ehicie 


Dose Concentration Time Points Pathology 


Pathology 






Present 


Absent 
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1,3-Dibromobenzene , 


Corn Oil 


High 


600 mg/kg/d 


1,3,7,14 d 


1 d 




1,3-Dibromobenzene 


Corn Oil 


Low 


60 mg/kg/d 


1,3,7,14 d 






1,4-dichlorobenzene 


Com Oil 


High 


300 mg/kg/d 


1,3,7,14 d 






1,4-dichlorobenzene 


Com Oil 


Low 


30 mg/kg/d 


1,3,7,14 d 




1,3,7,14 d 


17a-Ethynyl-19- 
nortestosterone 


Com Oil 


High 


30 mg/kg/d 


1,3,7,14 d 






17a-Ethynyl-19- 
nortestosterone 


Com Oil 


Low 


3 mg/kg/d 


1,3,7,14 d 






2,4-diaminotoluene 


Methylcellulose 


High 


15 mg/kg/d 


1,3,7,14 d 






2,4-diaminotoluene 


Methylcellulose 


Low 


1 .5 mg/kg/d 


1,3,7,14 d 






2-acetylaminofluorene 


Methylcellulose 


High 


12 mg/kg/d 


1,3,7,14 d 




1,3,7,14 d 


2 -acetylaminofluorene 


Methylcellulose 


Low 


1.2 mg/kg/d 


1,3,7,14 d 






2-acetylaminofluorene 


Methylcellulose 


High 


120 mg/kg(ip) 


6,12,24,48 h 






2 -acetylaminofluorene 


Methylcellulose 


Low 


12mg/kg(ip) 


6,12,24,48 h 






2-nitrofluorene 


Com oil 


High 


44 mg/kg/d 


1,3,7,14 d 






2-nitrofluorene 


Com oil 


Low 


4.4 mg/kg/d 


1,3,7,14 d 






3-methyl-4- 

(dimethylamino) 

azobenzene 


Methylcellulose 


High 


36 mg/kg/d 


1,3,7,14 d 






3-methyl-4- 

(dimethylamino) 

azobenzene 


Methylcellulose 


Low 


3.6 mg/kg/d 


1,3,7,14 d 




1,3,7,14 d 


3-methylcholanthrene 


Com Oil 


High 


25 mg/kg/d (ip) 


1,3,7,14 d 






3-methylcholanthrene 


Com Oil 


Low 


2.5 mg/kg/d (ip) 


1,3,7,14 d 




1,3,7,14 d 


Acetamide 


Methylcellulose 


High 


3000 mg/kg/d 


1,3,7,14 d 






Acetamide 


Methylcellulose 


Low 


300mg/kg/d 


1,3,7,14 d 






Acetaminophen 


Saline 


High 


4.25 g/kg(po) 


6,12,24,48 h 


24,48 h 




Acetaminophen 


Saline 


Low 


425 mg/kg (po) 


6,12,24,48 h 






Aflatoxin Bl 


Methylcellulose 


High 


0.24 mg/kg/d 


1,3,7,14 d 






AflatoxinBl 


Methylcellulose 


Low 


0.024 mg/kg/d 


1,3,7,14 d 






Allyl Alcohol 


Methylcellulose 


High 


36 mg/kg/d 


1,3,7,14 d 






Allyl Alcohol 


Methylcellulose 


Low 


3.6 mg/kg/d 


1,3,7,14 d 






Allyl Formate 


Com oil 


High 


94.8 mg/kg (ip) 


3, 6, 12,24h 






Allyl Formate 


Com oil 


Low 


9.48 mg/kg (ip) 


3, 6, 12,24 h 






Amiodarone 


Methylcellulose 


High 


500 mg/kg/d 


1,3,7,14 d 






Amiodarone 


Methylcellulose 


Low 


50 mg/kg/d 


1,3,7,14 d 






ANIT 


Com Oil 


High 


60 mg/kg/d 


1,3,7,14 d 






ANIT 


Com Oil 


Low 


6 mg/kg/d 


1,3,7,14 d 
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Azaserine 


Saline 


High 


100mg/kg(ip) 


1,3,7,14 d 


1,3 d 




Azaserine 


Saline 


Low 


lOmg/kg (ip) 


1,3,7,14 d 






BCNU 


Com Oil 


High 


20 mg/kg/d 


1,3,7,14 d 






BCNU 


Com Oil 


Low 


2 mg/kg/d 


1,3,7,14 d 






BHT 


Com Oil 


High 


500 mg/kg/d 


1,3,7,14 d 






BHT 


Com Oil 


Low 


50 mg/kg/d 


1,3,7,14 d 






Bromobenzene 


Saline 


High 


1200mg/kg(ip) 


6,12,24,48 h 


12,48 h 




Bromobenzene 


Saline 


Low 


120mg/kg (ip) 


6,12,24,48 h 






C.I. Direct Black 


Com oil 


High 


146 mg/kg/d 


1,3,7,14 d 






C.I. Direct Black 


Com oil 


Low 


14.6 mg/kg/d 


1,3,7,14 d 




1,3,7,14 d 


Cadmium chloride 


Saline 


^igh 


3.9mg/kg(iv) 


6,12,24,48 h 






Cadmium chloride 


Saline 


Low 


0.39 mg/kg (iv) 


6,12,24,48 h 






Carbamazepine 


Methylcellulose 


High 


200 mg/kg/d 


1,3,7,14 d 






Carbamazepine 


Methylcellulose 


Low 


20 mg/kg/d 


1,3,7,14 d 






CC1 4 


Com Oil 


High 


50 mg/kg/d 


1,3,7,14 d 


7d 




ecu 


Methylcellulose 


High 


956 mg/kg (ip) 


6,12,24,48 h 


6,12,48 h 




CCI4 


Com Oil 


Low 


5 mg/kg/d 


1,3,7,14 d 








lvicinyitciiuiuac 


Low 




6 1 2 24 48 h 






CCNU 


Com Oil 


High 


20 mg/kg/d 


1,3,7,14 d 






CCNU 


Com Oil 


Low 


2 mg/kg/d 


1,3,7,14 d 






Cefuroxime 


Methylcellulose 


Safe 


125 mg/kg/d 


1,3,7,14 d 




1,3,7,14 d 


Cerium (III) Chloride 


Saline 


High 


10 mg/kg (iv) 


1,2,3,4 d 


1,2,3,4 d 




Cerium (III) Chloride 


Saline 


Low 


1 mg/kg (iv) 


1,2,3,4 d 






Chlordane 


Com Oil 


High 


25 mg/kg/d 


1,3,7,14 d 






Chlordane 


r*r\m Oil 




0 5 mcr/ka/H 


1 3 7 14 d 




1,3,7,14 d 


L-iproiiurdie 


r nrn Oil 


Hi ah 


1 \J I 1 Vgf fc£f U 


1,3,7,14 d 






L-ipronoraie 




Low 




1 1 7 14 d 






Ciprofloxacin 


Methylcellulose 


Safe 


ACS mrr/Vtr/H 

*tu mg/Kg/u 


1 3 7 14 d 




1,3,7,14 d 


(J011 orate 


Methylcellulose 


Hign 


jk)\j mg/Kg/a 








Clofibrate 


Methylcellulose 


Low 


j\j mg/Kg/Q 


l, j,/, 14 a 






Clonidine 


Methylcellulose 


Safe 


0.1 mg/kg/d 


1,3,7,14 d 




1,3,7,14 d 


Colchicine 


Saline 


High 


5 mg/kg x 2 (ip) 


6,12,24,48 h 






Colchicine 


Saline 


Low 


0.5 mg/kg x 2 (ip) 


6,12,24,48 h 






Concanavalin A 


Saline 


High 


20 mg/kg (iv) 


6,12,24,48 h 
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Concanavalin A 


Saline 


Low 


2 mg/kg (iv) 


6,12,24,48 h 






Corn Oil 




Control 




1,3,7,14 d 




1,3,7,14 d 


Com Oil (ip) 




Control 




1,3,7,14 d 




1,3,7,14 d 


Corn Oil (ip) 




Control 




3,6,12,24 h 




3,6,12,24 h 


Coumarin 


Corn Oil 


High 


150 mg/kg/d 


1,3,7,14 d 


3d 




Coumarin 


Com Oil 


Low 


15mg/kg/d 


1,3,7,14 d 






CTFT 


Com Oil 


High 


1 g/kg/d 


1,3,7,14 d 






CTFT 


Com Oil 


Low 


100 mg/kg/d 


1,3,7,14 d 






Cyclosporine A 


Com Oil 


high 


50 mg/kg/day 


1,3,7,14 d 






Cyclosporin A 


Com Oil 


Low 


5 mg/kg/d 


1,3,7,14 d 




1,3,7,14 d 


Cyproterone acetate 


Com Oil 


High 


100 mg/kg/d 


1,3,7,14 d 






Cyproterone acetate 


Com Oil 


Low 


10 mg/kg/d 


1,3,7,14 d 




1,3,7,14 d 


Dehydroepiandrosterone 


Methylcellulose 


fligh 


600 mg/kg/d 


1,3,7,14 d 






Dehydroepiandrosterone 


Methylcellulose 


Low 


60 mg/kg/d 


1,3,7,14 d 






Deoxycholic Acid 


Methylcellulose 


High 


300 mg/kg/d 


1,3,7,14 d 






Deoxycholic Acid 


Methylcellulose 


Low 


30 mg/kg/d 


1,3,7,14 d 






Dexamethasone 


Com oil 


|High 


50 mg/kg/d (ip) 


1,3,7,14 d 






Dexamethasone 


Com oil 


Low 


5 mg/kg/d (ip) 






1 'X 1 1 & A 


D-galactosamine 


Saline 


High 


500 mg/kg (ip) 


6,12,24,48 h 






D-galactosamine 


Saline 


Low 


50 mg/kg (ip) 


6,12,24,48 h 






Di(2-Ethylhexyl) 
phthalate 


Methylcellulose 


High 


12UU mg/Kg/a 


1 1 7 1 A A 






Dif2-Ethvlhexvtt 
phthalate 


Methylcellulose 


Low 


120 mg/kg/d 


1,3,7,14 d 






Dibutyltin chloride 


Com oil 


High 


20 mg/kg/d 


1,3,7,14 d 






Dibutyltin chloride 


Com oil 


Low 


2 mg/kg/d 


1,3,7,14 d 






Dichloropropane 


Com oil 


High 


1000 mg/kg/d 


1,3,7,14 d 


1,3 d 




Dichloropropane 


Com oil 


Low 


100 mg/kg/d 


1,3,7,14 d 






Diethylnitrosamine 


Saline 


High 


150 mg/kg (ip) 


1,3,7,14 d 


1,3,7 d 




Diethylnitrosamine 


Saline 


Low 


15 mg/kg (ip) 


1,3,7,14 d 






Diethylstilbestrol 


Methylcellulose 


High 


10 mg/kg/d 


1,3,7,14 d 






Diethylstilbestrol 


Methylcellulose 


Low 


1 mg/kg/d 


1,3,7,14 d 






Dimethylformamide 


Saline 


High 


850 mg/kg (ip) 


6,12,24,48 h 


48 h 




Dimethylformamide 


Saline 


Low 


85 mg/kg (ip) 


6,12,24,48 h 






Dimethylnitrosamine 


Com Oil 


High 


4 mg/kg/d 


1,3,7,14 d 


7,14 d 




Dimethylnitrosamine 


Com Oil 


Low 


0.4 mg/kg/d 


1,3,7,14 d 
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Diouat 


Saline 


High 


36 me/ke (m\ 


6,12,24,48 h 






Diauat 


Saline 


Low 


3 6 me/ke (in) 


6,12,24,48 h 




6,12,24,48 h 


Disopyramide 


M eth vlcellulose 


Safe 


20 mg/kg/d 


1,3,7,14 d 




1,3,7,14 d 


Endotoxin 


Saline 


High 


8 mg/kg (ip) 


6,12,24,48 h 






Endotoxin 


Saline 


Low 


0.8 mg/kg (ip) 


6,12,24,48 h 






Erythromycin Estolate 




High 


800 mg/kg/d 


1,3,7,14 d 






Erythromycin Estolate 
(EE) 


Methylcellulose 


Low 


80 mg/kg/d 


1,3,7,14 d 




1,3,7,14 d 


Ethanol 


Corn Oil 


High 


15g/kg/d 


1,3,7,14 d 






Ethanol 


Corn Oil 


Low 


1.5g/kg/d 


1,3,7,14 d 




1,3,7,14 d 


Ethinylestradiol 


Com Oil 


High 


15 mg/kg/d 


1,3,7,14 d 






Ethinylestradiol 


Com Oil 


Low 


1 .5 mg/kg/d 


1,3,7,14 d 

/ 






Ethionine 


Metnylcellulose 


High 


OrtA mo/I/-**//') 

zoo mg/Kg/a 


1 1 H 1 A A 1 

1,3, /,I4 a 


1 1 A 

1,3 u 




CU I1U11I11C 


MpthvIppllnlnQp 


Low 


20 me/ke/d 


1 3 7 14 d 






Ethosuximide 


Methylcellulose 


Safe 


100 mg/kg/d 


1,3,7,14 d 




1,3,7,14 d 


Ethylenethiourea 


Saline 


High 


920 mg/kg (po) 


6,12,24,48 h 






Ethylenethiourea 


Saline 


Low 


92 mg/kg (po) 


6,12,24,48 h 






Fenarimol 


Com Oil 


High 


62.5 mg/kg/d 


1,3,7,14 d 






Fenarimol 


Com Oil 


Low 


6.25 mg/kg/d 


1,3,7,14 d 






Fenbendazole 


Methylcellulose 


High 


3000 mg/kg/d 


1,3,7,14 d 






Fenbendazole 


Methylcellulose 


Low 


300 mg/kg/d 


1,3,7,14 d 




1,3,7,14 d 


Furan 


Com Oil 


High 


15 mg/kg/d 


1,3,7,14 d 








Com Oil 


Low 


1 .5 mg/kg/d 


1,3,7,14 d 




1,3,7,14 d 


HvHra7i'np 


Methy lcel lu lose 


High 


100 mg/kg/d 


1,3,7,14 d 








MpthvlcelliilfKe 


Low 


1 0 mg/kg/d 


1,3,7,14 d 






Ibuprofen 


Methylcellulose 


Safe 


94 mg/kg/d 


1,3,7,14 d 




1,3,7,14 d 


Ketoconazole 


Methylcellulose 


High 


100 mg/kg/d 


1,3,7,14 d 






Ketoconazole 


Methylcellulose 


Low 


10 mg/kg/d 


1,3,7,14 d 




1,3 d 


Lead nitrate 


Saline 


High 


33 g/kg (iv) 


1,3,7,14 d 






Lead nitrate 


Saline 


Low 


1 1 n/l/n /!.r\ 

3 J g/Kg(lV) 


\ 17 1/1/1 

1,3,/, 14 a 






Methapyrilene 


Methylcellulose 


High 


60 mg/kg/d 


1,3,7,14 d 






Methapyrilene 


Methylcellulose 


Low 


6 mg/kg/d 


1,3,7,14 d 






Methionine-choline 
deficient diet 


In feed 


High 


60 g/kg/d 


1,3,7,14 d 






Methyl Carbamate 


Methylcellulose 


High 


400 mg/kg/d 


1,3,7,14 d 
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Methyl Carbamate 


Methylcellulose 


Low 


40 mg/kg/d 


1,3,7,14 d 






Methylcellulose 




Control 




1,3,7,14 d 




1,3,7,14 d 


Methylcellulose (ip) 




Control 




6,12,24,48 h 






Methylenedianiline 


Corn Oil 


High 


50 mg/kg/d 


1,3,7,14 d 






Methylenedianiline 


Corn Oil 


Low 


5 mg/kg/d 


1,3,7,14 d 






Methyleugenol 


Methylcellulose 


High 


1000 mg/kg/d 


1,3,7,14 d 






Methyleugenol 


Methylcellulose 


Low 


100 mg/kg/d 


1,3,7,14 d 






Methyl-tert-butyl ether 


Com Oil 


High 


1500 mg/kg/d 


1,3,7,14 d 






Methyl-tert-butyl ether 


Corn Oil 


Low 


150 mg/kg/d 


1,3,7,14 d 






Microcystin-LR 


Saline 


High 


20u-g/kg(iv) 


6,12,24,48 h 






Microcystin-LR 


Saline 


Low 


2 u,g/kg (iv) 


6,12,24,48 h 






Mirex 


Com Oil 


High 


10 mg/kg/d 


1,3,7,14 d 






Mirex 


Com Oil 


Low 


1 mg/kg/d 


1,3,7,14 d 






Molybdenum 


Methylcellulose 


High 


500 mg/kg/d 


1,3,7,14 d 


3d 




Molybdenum 


Methylcellulose 


Low 


50 mg/kg/d 


1,3,7,14 d 






Monocrotaline 


H 2 0 


High 


160 mg/kg/d 


1,3,7,14 d 


1,3 d 




Monorrnta line 


H 2 0 


Low 


1 6 mg/kg/d 


1,3,7,14 d 






N-diethvltiitrfwarnine 


Methvl cellulose 


High 


1 2 me/ke/d 


1,3,7,14 d 






N-HiethvltiifrfKamttip 


Mefhvlcellulosp 


Low 


1 2 me/ke/d 


1 3 7 14 d 






Nifedipine 


Methylcellulose 


Safe 


3 mg/kg/d 


1,3,7,14 d 




1,3,7,14 d 


Nitrofurantoin 


Saline 


High 


150 me/kg (ip) 


1,3,7,14 d 






Nitrofurantoin 


Saline 


Low 


15mg/kg(ip) 


1,3,7,14 d 




1,3,7,14 d 


Nitrosodiethanolamine 


Methylcellulose 


High 


200 mg/kg/d 


1,3,7,14 d 






Nitrosodiethanolamine 


Methylcellulose 


Low 


20 mg/kg/d 


1,3,7,14 d 




1,3,7,14 d 


Nitrosomethylethylamine 


Methylcellulose 


High 


75 mg/kg/d 


1,3,7,14 d 


1,3 d 




Ni trosomethylethylamine 


Methylcellulose 


Low 


7.5 mg/kg/d 


l,3,7,14d 


3,7,14 d 




N-nitrosodibutylamine 


Methylcellulose 


High 


25 mg/kg/d 


1,3,7,14 d 






N-nitrosodibutylamine 


Methylcellulose 


Low 


2.5 mg/kg/d 


1,3,7,14 d 




1,3,7,14 d 


N-nitrosomorpholine 


Methylcellulose 


High 


35 mg/kg/d 


1,3,7 d 


1,3,7,14 d 




N-nitrosomorpholine 


Methylcellulose 


Low 


3.5 mg/kg/d 


1,3,7,14 d 






N-Nitrosopiperidine 


Methylcellulose 


High 


200 mg/kg/d 


1,3,7,14 d 


1,3 d 




N-Nitrosopiperidine 


Methylcellulose 


Low 


20 mg/kg/d 


1,3,7,14 d 






NNK 


Methylcellulose 


High 


20 mg/kg/d 


1,3,7,14 d 






NNK 


Methylcellulose 


Low 


2 mg/kg/d 


1,3,7,14 d 






Pentachlorophenol 


Methylcellulose 


High 


50 mg/kg/d 


1,3,7,14 d 
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Pentachlorophenol 


Methylcellulose 


Low 


5 mg/kg/d 


1,3,7,14 d 




1,3,7,14 d 


Phenobarbital 


Methylcellulose 


High 


80 mg/kg/d 


1,3,7,14 d 






Phenobarbital 


Methylcellulose 


Low 


8 mg/kg/d 


1,3,7,14 d 






Piperonyl Butoxide 


Methylcellulose 


High 


1200 mg/kg/d 


1,3,7,14 d 






Piperonyl Butoxide 


Methylcellulose 


Low 


1 l\) mg/kg/d 


1 1 1 1 A A 

1,3,7,14 d 






Potassium bichromate 


Methylcellulose 


High 


I0mg/kg(ip) 


6,12,24,48 h 






Potassium bichromate 


Methylcellulose 


Low 


I mg/kg (ip) 


6,12,24,48 h 






Prazosin 


Methylcellulose 


!>ate 


I mg/kg/d 


1,3,7,14 d 




1,3,7,14 d 


Pregnenolone- 16a- 
Carbonitrile 


Methylcellulose 


^igh 


100 mg/kg/d 


1,3,7,14 d 






Pregnenolone- 16a- 
Carbonitrile 


Methylcellulose 


Low 


10 mg/kg/d 


1,3,7,14 d 






Propranolol 


Methylcellulose 


Safe 


40 mg/kg/d 


1,3,7,14 d 




1,3,7,14 d 


Pyridine 


Methylcellulose 


High 


300 mg/kg/d 


1,3,7,14 d 


14 d 




Pyridine 


Methylcellulose 


Low 


30 mg/kg/d 


1,3,7,14 d 






Ranitidine 


Methylcellulose 


sale 


5 mg/kg/d 


1,3,7,14 d 




1,3,7,14 d 


Rifampicin 


Methylcellulose 


High 


250 mg/kg/d 


1,3,7,14 d 






Rifampicin 


Methylcellulose 


Low 


25 mg/kg/d 


1,3,7,14 d 




1,3,7,14 d 


Saline (ip) 




Control 




6,12,24,48 h 




6,12,24,48 h 


Saline (ip) 




Control 




1,3,7,14 d 




1,3,7,14 d 


Saline (iv) 




Control 




6,12,24,48 h 




6,12,24,48 h 


oaline (iv) 




uontroi 




1 T 1 A A 
1,2, 0 




1 jZjJ, 1 * 0 


baiine (iv) 




Control 




1 1 7 1 A A 




1 1 1 1 A A 


Saline (po) 




Control 




fC 1 1 1/1 AQ U 




IT 11 1 A AO U 

0,1 z,z4,io n 


Terfenadine 


Methylcellulose 


Safe 


10 mg/kg/d 


1,3,7,14 d 




1,3,7,14 d 


Thioacetamide 


Saline 


High 


200 mg/kg (ip) 


6,12,24,48 h 


12,24,48 h 




Thioacetamide 


Saline 


Low 


20 mg/kg (ip) 


6,12,24,48 h 


6,12,24,48 h 




Thioacetamide 


Methylcellulose 


^igh 


19.2 mg/kg/d 


1,3,7,14 d 






Thioacetamide 


Methylcellulose 


Low 


1.92 mg/kg/d 


1,3,7,14 d 






Thiobenzamide 


In Feed 


High 


ad libitum 1 g/kg 


1,3,7,14 d 


1,3,7,14 d 




Thiobenzamide 


In Feed 


Low 


ad libitum 0.1 g/kg 


1,3,7,14 d 






Untreated 




Control 




1,3,7,14 d 




1,3,7,14 d 


Untreated 




Control 




6,12,24,48 h 




6,12,24,48 h 


Untreated 




Control 




1,2,3,4 d 




1,2,3,4 d 


WY-14643 


Methylcellulose 


High 


60 mg/kg/d 


1,3,7,14 d 
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WY-14643 


Methylcellulose 


Low 


6mg/kg/d 


1,3,7,14 d 






Xylidine 


Com Oil 


High 


600 mg/kg/d 


1,3,7,14 d 






Xylidine 


Corn Oil 


Low 


60 mg/kg/d 


1,3,7,14 d 







Example 2: Induction of Zone 3 Necrosis in vitro 

Over 100 compounds were chosen based on evidence that they elicit one of eleven 
selected histopathology subtypes. The criteria for inclusion of a compound into the zone 3 
5 necrosis mode was necrosis or individual cell necrosis in centrilobular zone with an increase in 
some or all serum transaminases. The compounds included in the zone 3 necrosis mode can been 
seen in Table 2. Rat primary hepatocytes were treated with the same compounds used in the in 
vivo experiments. In all cases, compounds were run in any given two-day period with an 
untreated and DMSO vehicle as negative controls. Each compound was delivered daily at a 

10 reference dose derived from the literature (when available), a lOx high dose, and 1/10 and 1/100 
low doses for up to 72 hours. Three hepatocyte cultures for each dose were randomly assigned 
for harvest at 6, 24, 48 and 72 hours. In order to best identify genes characteristic of zone 3 
necrosis, total RNA for all hepatocyte cultures from a given dose time point were pooled and 
converted to mRNA and cDNA for GeneCalling® In GeneCalling, the cDNA is cut with a 

1 5 battery of restriction enzyme pairs in different combinations followed by amplification by PCR 
using specific primers linked to specific adaptors. After gel electrophoresis, the resulting 
fragments are identified based on the inherent information in the cDNA fragment: The flanking 
restriction site sequences on the ends, the size of the fragment and the species (and sometimes 
the tissue) origin of the DNA. This information is used to query public and proprietary 

20 databases. The fragments that do not match any sequences in the database are isolated, 
sequenced and identified as novel. 

Table 2: Zone 3 Necrosis in vitro 



Compound 


Annotation 


Concentration 
(microM) 


Zone 3 Necrosis 
Pathology 


1 ,3-Dibromobenzene 




63 




1 ,3-Dibromobenzene 




250 


Present 


2AAF 




10 




2AAF 




50 




2-nitrofluorene 




30 





30 



2-nitrofluorene 




150 




Acetamide 




500 




Acetamide 




2000 




Acetaminophen 




100 




Acetaminophen 




1000 


Present 


Aflatoxin 




0.008 




Aflatoxin 




0.04 




Allylalcohol 




16 




Allylalcohol 




80 




Allylformate 




0.06 




Allylformate 




0.3 




Amiodarone 




5 




Amiodarone 




20 




ANIT 




3.1 




AN IT 




13 




Aspirin 


non-toxic 


55.5 


Absent 


Aspirin 


non-toxic 


555 




Atorvastatin 


non-toxic 


0.0095 


Absent 


Atorvastatin 


non-toxic 


0.095 




Azaserine 




100 




Azaserine 




500 


Present 


Azobenzene 




1.6 


Absent 


Azobenzene 




8 




BCNU 




50 




BCNU 




250 




Butylhydroxytoluene 




75 




Butylhydroxytoluene 




T50 




Bretylium 


non-toxic 


1.2 


Absent 


Bretylium 


non-toxic 


12 




Bromobenzene 




600 




Bromobenzene 




3000 


Present 


Carbamate 




300 




Carbamate 




1500 




Carbamazepine 




200 




Carbamazepine 




1000 




CCNU 




8 




CCNU 




40 




CdCI 




0.1 




CdCI 




0.5 




CeCI3 




4 




CeCI3 




20 


Present 


Cefuroxime 


non-toxic 


224 


Absent 


Cefuroxime 


non-toxic 


2240 




Chlordane 




8 


Absent 


Chlordane 




40 





31 



CIDirect 




1 




CIDirect 




5 




Ciprofibrate 




100 




Ciprofibrate 




500 




Clofibrate 




100 


Absent 


Clofibrate 




400 




Clonidine 


non-toxic 


0.0165 


Absent 


Clonidine 


non-toxic 


0.165 




Colchicine 




500 




Colchicine 




2000 




Concanavalin A 




5000 




Concanavalin A 




20000 




Coumarin 




63 




Coumarin 




250 


Present 


4-chlorobenzotrifluoride 




250 




4-chlorobenzotrifluoride 




1000 




Cyclosporine 




3.1 


Absent 


Cyclosporine 




13 




Cyproterone 




10 




Cyproterone 




50 




2,4-diaminotoluene 




0.8 




2,4-diaminotoluene 




4 




Di(2-Ethylhexyl)phthalate 




500 




Di(2-Ethylhexyl)phthalate 




2000 




Dehydroepiandosterone 




1.2 




Dehydroepiandosterone 




6 




Deoxycholate 




6.3 




Deoxycholate 




25 




Dexamethasone 




100 


Absent 


Dexamethasone 




500 




Dibutyltin 




0.2 




Dibutyltin 




1 




Dichlorobenzene 




100 


Absent 


Dichlorobenzene 




500 




Diethylnitrosamine 




60 




Diethylnitrosamine 




300 


Present 


Diethylstillbestrol 




5 




Diethylstillbestrol 




50 




Dimethylnitrosamine 




200 




Dimethylnitrosamine 




1000 


Present 


Disopyramide 


non-toxic 


3 


Absent 


Disopyramide 


non-toxic 


30 




Dimethylformamide 




1000 




Dimethylformamide 




5000 


Present 


DMSO 


control 


0 


Absent 



32 



Doxorubicin 


non-toxic 


0.5 


Absent 


Doxorubicin 


non-toxic 


5 




Endotoxin 




30 




Endotoxin 




100 




Erythromycin 




20 


Absent 


Erythromycin 




100 




Ethanol 




200 


Absent 


Ethanol 




1000 




Ethinylestradiol 




25 




Ethinylestradiol 




100 




Ethionine 




200 




Ethionine 




1000 


Present 


Ethosuximide 


non-toxic 


1000 


Absent 


Ethosuximide 


non-toxic 


10000 




Ethylenethiourea 




200 




Ethylenethiourea 




1000 




Fenarimol 




20 


Absent 


Fenarimol 




100 




Fenbendazole 




16 




Fenbendazole 




63 




Fluconazole 


non-toxic 


0.816 


Absent 


Fluconazole 


non-toxic 


8.16 




Gabapentin 


non-toxic 


2 


Absent 


Gabapentin 


non-toxic 


20 




Galactosamine 




12 




Gaiactosamine 




60 




Hydrazine 




20 




Hydrazine 




100 




Ibuprofen 


non-toxic 


50 


Absent 


Ketoconazole 




2 


Absent 


Ketoconazole 




10 




Mephenytoin 


non-toxic 


14.2 


Absent 


Mephenytoin 


non-toxic 


142 




Methapyriline 




30 




Methapyriline 




100 




Methylcholanthrene 




40 


Absent 


Methylcholanthrene 




200 




Methylenedianiline 




1.4 




Methylenedianiline 




7.8 




Methylleugenol 




100 




Methylleugenol 




500 




Microcystin 




0.005 




Microcystin 




0.025 




Minoxidil 


non-toxic 


0.166 


Absent 


Minoxidil 


non-toxic 


1.66 





33 



Mirex 




50 


Absent 


Mirex 




100 




Molybdenum 




20 




Molybdenum 




50 


Present 


Monocrotaline 




30 




Monocrotaline 




100 


Present 


Methyl-tert-buty[ ether 




1000 




Methyl-tert-butyl ether 




4000 




Nifedipine 


non-toxic 


0.335 


Absent 


Nifedipine 


non-toxic 


3.35 




Nitrofurantoin 




4 


Absent 


Nitrofurantoin 




20 




Nitrosodibutylamine 




200 


Absent 


Nitrosodibutylamine 




1000 




Nitrosodiethanolamine 




1000 


Absent 


Nitrosodiethanolamine 




5000 




Nitrosomethylethylamine 




200 




Nitrosomethylethylamine 




1000 


Present 


Nitrosomorpholine 




750 




Nitrosomorpholine 




3750 


Present 


Nitrosopiperidine 




640 




Nitrosopiperidine 




3200 


Present 


NNK 




200 




NNK 




1000 




Norethindrone 




40 


Absent 


Norethindrone 




200 




Pentachlorophenol 




19 


Absent 


Pentachlorophenol 




38 




Piperonyl 




20 




Piperony) 




100 




Prazosin 


non-toxic 


0.0148 


Absent 


Prazosin 


non-toxic 


0.148 




Pregnenolone 




38 


Absent 


Pregnenolone 




150 




Propranolol 


non-toxic 


0.125 


Absent 


Propranolol 


non-toxic 


1.25 




Pyridine 




800 




Pyridine 




4000 


Present 


Ranitidine 


non-toxic 


0.128 


Absent 


Ranitidine 


non-toxic 


1.28 




Rifampicin 




20 


Absent 


Rifampicin 




100 




Terfenadine 


non-toxic 


0.15 


Absent 


Terfenadine 


non-toxic 


1.5 




Thioacetamide 




500 





34 



Thioacetamide 




2000 


Present 


Thiobenzamide 




8 




Thiobenzamide 




40 


Present 


Untreated 


control 


o 1 


Absent 


Verapamil 


non-toxic 


0.1 


Absent 


Verapamil 


non-toxic 


1 




WY14643 




20 




WY14643 




100 




Xylidine 




13 




Xylidine 




50 





Example 3: Methods of Analysis 



Data Preparation: 

We used GeneCalling® to estimate the activity of several thousand transcripts 
simultaneously. These data generally have -5% missing data and are log normally distributed. 
5 The data are log transformed and missing values are filled using k-nearest neighbor (knn) 
replacement [14]. The knn algorithm was initially validated using a complete data set and 
randomly eliminating constant percentages of the data. It was determined that using correlation 
as a similarity index and imputing missing values with 6 nearest neighbors resulted in the 
smallest error of prediction. 

10 Initially our data sets contained between 6000 and 8000 genes, which poses two 

problems. These large numbers of genes make most marker selection procedures 
computationally intractable with most computer algorithms. Second, the inclusion of markers 
with low variation, or low association with pathology results in a significant risk of choosing 
markers that over fit the models. To eliminate these problems we imposed an initial filter on the 

1 5 data, requiring that there be a significant difference between negative control samples and 

positive control samples for each pathology mode (Kruskal-Wallis test, p<0.001). Depending on 
mode, this process reduced our gene set to a more tractable number of genes (approximately 200- 
800 genes depending on mode). 

Initial Marker Selection: 

20 We define a marker as a gene that helps to explain some variation in pathology. In order 

to avoid selecting markers that particularly fit our current data set well at the expense of 
predictability outside our training set, we employed a leave one out cross-validation method to 
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identify markers that contribute some explanatory power to the data set. Specifically, after the 
Kruskal-Wallis filter, a series of leave one out models are created leaving out all of the samples 
for each compound until all compounds have been left out once. This process results in a marker 
list and a count of the number of leave-one out models the marker was used in. This marker list 
5 contains all of the genes that explain some portion of the variation in pathology but is almost 
certain to over fit the data because of its size. In order to refine this marker list a second series of 
leave one compound out models is created for each different count of markers within the marker 
list. For example, the initial leave one out model may produce a gene list consisting of 5 genes 
that occur at frequencies of 20, 19, 19, 2, and 1 leave one out models respectively. The first step 

10 will use all genes that occur in 1 or more leave one out models, the second step 2 or more, the 
third 19 or more and finally 20 or more. The genes that are considered to be markers will have 
been used in a majority of models and result in a highly sensitive model. In most cases the first 
modeling step resulted in a sharp cutoff (e.g. 19 or more in the above example), which guides the 
marker selection process. In a few cases, the change in frequency was so gradual that no clear 

15 cutoff was available. Marker selection then proceeded with the most sensitive model first, and 
then the most specific model and in the case of ties the least number of markers. 

Models: 

The above process is a general strategy that is applied to all of our marker selection 
models. The models we used covered a range of statistical power and assumption stringency. 

20 The most powerful model with the strongest assumptions is a linear discriminant analysis, 
followed by logistic regression and finally by classification trees, which is virtually devoid of 
assumptions but does have a cost in terms of predictivity. These modeling methods are common 
statistical procedures that need not be developed here [15] [16] for a more detailed discussion). 
All three methods went through the algorithm outlined above with the exception of discriminant 

25 analysis, which did not utilize a Kruskal-Wallis filter. Both discriminant analysis and logistic 
regression create poor models when too many intercorrelated variables are used. To minimize 
this problem, these methods utilized a stepwise selection procedure (incorporating both forward 
and reverse selection) to select the best discriminating set of markers. 
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In vivo Pathology Annotation: 

In order to construct in vivo models, the pathology of each sample had to be determined. 
Pathology was assigned to each liver sample by the pathologists at Bayer's Stillwell, Kansas 
facility. Criteria for inclusion of a compound into a particular mode included the following: 

i. Zone- 3 Necrosis: Necrosis or individual cell necrosis in centrilobular zone with an 
increase in some or all serum transaminases. 

il Cholestasis: Increased plasma billirubin with bile duct necrosis or hyperplasia. 

Hi. Hypertrophy: Increase in cell size and liver weight. 

iv. Genotoxic Carcinogens: Some evidence of mild parenchymal damage in vivo 
which may be associated with an increase in mitosis. Compounds assigned to this 
group must have strong historical documentation. Compounds from other 
pathology modes cannot be added to this list. 

v. Non-genotoxic Carcinogens: Evidence of parenchymal changes and an increase in 
mitosis. Compounds assigned to this group must have strong historical 
documentation. Compounds from other pathology modes cannot be added to this 
list. 

vL Steatosis: Increase in lipid accumulation or "vacuolar degeneration." 

vii. Zone 1 Necrosis: Necrosis or individual cell necrosis in the periportal zone, with 
an increase in some or all serum transaminases, 

vii. Inflammation: Increase in inflammatory cells (e.g. Kupffer cells, neutrophils, 
macrophages, lymphocytes) 

vm. Apoptosis: Shrinking or fragmentation of the nucleus and increased "blebbing." 

In vitro Pathology Annotation: 

In order to construct in vitro models we need to make a decision about what pathology 
each sample represents. Two approaches were used. The first is to ascribe the in vivo compound 
annotation to the two highest concentrations in vitro (these are traditionally within five fold of 
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each other). For example, clofibrate produced hypertrophy in vivo, so the two highest doses in 
vitro are used as a positive control for the model construction process. The second annotation 
strategy uses a nearest neighbor algorithm to assign annotation from in vivo samples to in vitro 
samples. Briefly, each in vitro sample was correlated, across in vivo markers, to all in vivo 
5 samples of the same compound. The annotation of the most correlated in vivo sample was used 
as the in vitro annotation. 



Final Marker Selection: 

This process resulted in 6 sets of models being generated for each mode of pathology 
(LDA, logistic, and classification trees for each annotation strategy, nearest neighbor and high 
10 dose). These markers are then correlated with the original data set (between 6000 and 8000 

genes) and additional correlated markers (r>=0.60 across 329 samples, up to 3 per marker) were 
added back in to the final gene set for representation on a microarray. 



Example 4: Identification of Zone 3 Necrosis Related Genes in vivo 

15 

SP = Secreted Protein 

NC = Novel Rat Composition 

NU = Novel Rat Utility 

20 Table 3 





ACCN 
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MARKER 
ASSIGNMEN 
T 


SEQID 
NO: 


Definition 


Description 


Bin 


NU 


scr gb- 
af03887 
0 4 


1 


1 


Rattus norvegicus 
betaine homocysteine 
methyl transferase 
(BHMT) [AF038870]. 


Betaine-homocysteine methyltransferase 
(BHMT) catalyzes the transfer of an N-methyl 
group from betaine to homocysteine to produce 
dimethylglycine and methionine, respectively. 
The enzyme is found in the pathway of choline 
oxidation and is abundantly expressed in liver 
and kidney. It has been known for at least 50 
years that alterations in methionine metabolism 
occur in human liver cirrhosis. Recently human 
BHMT had been shown to be a zinc 
metalloenzyme [14] [15]. 


Amino Acid 
Metabolism 
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1241/1396) to Mus a 
iuscuIus betaine- y 
omocysteine c 
nethyltransferase 2 E 
Bhmt2) [AF257474]. n 


etaine-homocysteine methyitransferase 
JHMT) catalyzes the transfer of an N-methyl 
roup from betaine to homocysteine to produce 
imethylglycine and methionine, respectively, 
he enzyme is found in the pathway of choline 
xidation and is abundantly expressed in liver 
nd kidney. It has been known for at least 50 
ears that alterations in methionine metabolism 
ccur in human liver cirrhosis. Recently human 
IHMT had been shown to be a zinc P 
netalloenzyme [14] [15]. * 


tfnino Acid 
Metabolism 


scr gb- 
x95189_ 
NU 4 
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F 

c 
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c 

i 

Rattus norvegicus s 
rrihydroxycoprostano 1 
/1-CoA Oxidase 
k X95189]. 


*at liver peroxisomes contain three acyl-CoA 
>xidases:palmitoyl-CoA oxidase, pristanoyl-CoA 
>xidase, and trihydroxycoprostanoyl-CoA 
>xidase. Mammalian liver peroxisomes are 
capable of beta-oxidizing a variety of substrates 
ncluding very long chain fatty acids and the 
side chains of the bile acid intermediates di- and 
rihydroxycoprostanic acid. The first enzyme of 
peroxisomal beta-oxidation is acyl-CoA oxidase 
[16]. 


Jpid Metabolism 


scr gb- 
m59814 
NU J 


4 


4 


Rattus norvegicus 
Ephrin type-B 
receptor 1 precursor 
(EphB1)[P09759] 


Eph receptor tyrosine kinases and their 
membrane-bound ligands, ephrins, have thus 
emerged as mediators of cell-contact- 
dependent repulsion. The actin cytoskeleton is 
also a major target of the intracellular pathways 
activated by Eph receptors [17]. More 
specifically, activation of EphB1 by its ligand, 
ephrin-B1/Fc has been shown to recruit Nek to 
native receptor complexes and activate c-Jun 
kinase (JNK/SAPK) [18] 


Cell Cycle 
Regulation 
(Regulation Of 
Proliferation) 


scr gb~ 
m29358 
NU _5 


5 


5 


Rattus norvegicus 
ribosomal protein S6 
[M29358]. 


It has been known for 20 years that the 
ribosomal protein S6 is rapidly phosphorylated 
when cells are stimulated to grow or divide [19]. 
S6 is phosphorylated in response to mitogens 
by activation of one or more protein kinase 
cascades. Members of the 90 kDa S6 kinases 
are activated in vitro by 42 kDa and 44 kDa 
MAP kinases, which are in turn activated by 
mitogen-dependent activators [20]. 


Protein 
Metabolism 


aj29773 
NU 6 


6 


6 


Rattus norvegicus 
heat shock protein 86 
(hsp86)[AJ428213]. 


In addition to appearing in response to 
biological stresses, heat shock proteins are 
expressed as 'chaperones* by some cells living 
in physiological conditions. Among these 
proteins, the Hsp90 family, consisting of 
isoforms Hsp84 and Hsp86, seems to function 
under normal growth conditions in the pathways 
of numerous signal transducers, cell cycle and 
developmental regulators. [21,22] 


Protein 
Metabolism 


NU |00719 


7 


7 


Rattus norvegicus 
cytochrome p-450 
isoform, 
(phenobarbital- 
inducible or 2B1) 
[J00719] [P04167]. 


The cytochromes P-450 are among the major 
constituent proteins of the liver mixed function 
monooxygenases. They play a central role in 
the metabolism of steroids, the detoxification ol 
drugs and xenobiotics, and the activation of 

„„„ rA innnnnc In liv/or miprn^omfiS this 

procarcinogens. in iivei iiuuwumco, ui» 
enzyme is involved in an NADPH-dependent 
electron transport pathway. It oxidizes a variety 
of structurally unrelated compounds, including 
steroids, fatty acids, and xenobiotics [23]. 


Detoxification 
Response / 
Biotransformation 
-TOX 
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NU 


100720 


8 


8 


Rattus norvegicus 
cytochrome p-450 
isoform, 
(phenobarbital- 
inducible or 2B2) 
[P04167] [J00719]. 


The cytochromes P-450 are among the major 
constituent proteins of the liver mixed function 
monooxygenases. They piay a central role in 
the metabolism of steroids, the detoxification of 
drugs and xenobiotics, and the activation of 
procarcinogens. In liver microsomes, this 
enzyme is involved in an NADPH-dependent 
electron transport pathway. It oxidizes a variety 
of structurally unrelated compounds, including 
steroids, fatty acids, and xenobiotics [23]. 


Detoxification 
Response / 
Biotransformation 
-TOX 


NU 


j00728 


9 


9 


Rattus norvegicus 
cytochrome p-450 
isoform, 
(phenobarbital- 
inducible, 2B1, or 
2B2) [P00176] 
[P04167][Q64584]. 


The cytochromes P-450 are among the major 
constituent proteins of the liver mixed function 
monooxygenases. They play a central rote in 
the metabolism of steroids, the detoxification of 
drugs and xenobiotics, and the activation of 
procarcinogens. In liver microsomes, this 
enzyme is involved in an NADPH-dependent 
electron transport pathway. It oxidizes a variety 
of structurally unrelated compounds, including 
steroids, fatty acids, and xenobiotics [23]. 


Detoxification 
Response / 
Biotransformation 
-TOX 


NU 


100320 


10 


10 


Rattus norvegicus 
cytochrome p-450 
isoform 

(phenobarbital- 
inducible, 2B1, or 
2B2) [J00719] 
[P00176] [P04167]. 


The cytochromes P-450 are among the major 
constituent proteins of the liver mixed function 
monooxygenases. They play a central role in 
the metabolism of steroids, the detoxification of 
drugs and xenobiotics, and the activation of 
procarcinogens. In liver microsomes, this 
enzyme is involved in an NADPH-dependent 
electron transport pathway. It oxidizes a variety 
of structurally unrelated compounds, including 
steroids, fatty acids, and xenobiotics [23]. 


Detoxification 
Response / 
Biotransformation 
-TOX 


NU 


m11251 


11 


11 


Rattus norvegicus 
cytochrome p-450 
isoform 

(phenobarbital- 
inducible, 2B1, or 
2B2) [Q64584] 
P00176][P04167]. 


The cytochromes P-450 are among the major 
constituent proteins of the liver mixed function 
monooxygenases. They play a central role in 
the metabolism of steroids, the detoxification of 
drugs and xenobiotics, and the activation of 
procarcinogens. In liver microsomes, this 
enzyme is involved in an NADPH-dependent 
electron transport pathway. It oxidizes a variety 
of structurally unrelated compounds, including 
steroids, fatty acids, and xenobiotics [23]. 


Detoxification 
Response / 
Biotransformation 
-TOX 


NU 


m26125 


12 


12 


Rattus norvegicus 
epoxide hydrolase 
[M26125]. 


Epoxide formation from drugs, chemicals, food 
additives and environmental pollutants is 
catalyzed by cytochrome P-450 dependent 
monooxygenase(s). Epoxides are converted to 
glycols or dihydrodiols by epoxide hydrolase. 
These enzymes are known to be present in the 
microsomes of different mammalian tissues and 
in the hepatic nuclei from rats and humans. The 
balance between the epoxide forming (AHH) 
and metabolizing (EH) enzyme activities may 
provide information about the "epoxide 
exposure" of a tissue [Kuklin, 1976 #2]. 


Detoxification 
Response / 
Biotransformation 
-TOX 
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NU 


m34452 


13 


13 


Rattus norvegicus 
cytochrome P450e-L 
(P450HB2) [M34452]. 


Epoxide formation from drugs, chemicals, food 
additives and environmental pollutants is 
catalyzed by cytochrome P-450 dependent 
monooxygenase(s). Epoxides are converted to 
glycols or dihydrodiols by epoxide hydrolase. 
These enzymes are known to be present in the 
microsomes of different mammalian tissues and 
in the hepatic nuclei from rats and humans. The 
balance between the epoxide forming (AHH) 
and metabolizing (EH) enzyme activities may 
provide information about the "epoxide 
exposure" of a tissue [Kuklin, 1976 #2]. 


Detoxification 
Response / 
Biotransformation 
-TOX 


NU 


U33546 


14 


14 


Rattus norvegicus 

CYP2B16P 

[CAB35441]. 


CYP2B16P is an apparent pseudogene in the 
rat cytochrome P450 2B (CYP2B) subfamily 
[24]. 


Detoxification 
Response / 
Biotransformation 
-TOX 


NU 


X74673 


15 


15 


Rattus norvegicus 
aflatoxin B1 aldehyde 
reductase (AFAR) 
[X74673]. 


Aflatoxin B1 aldehyde reductase/Succinic 
semialdehyde reductase is believed to be 
involved in the detoxification of xenobiotic 
carbonyl compounds [25]. 


Detoxification 
Response / 
Biotransformation 
-TOX 
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X13044 
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16 


16 


Rattus norvegicus 
MHC-associated 
invariant chain 
gamma (la antigen- 
associated invariant 
chain) (li) [X13044]. 


MHC class il molecules at the surface of 
antigen presenting cells present antigenic 
peptides to CD4+ T helper cells, li plays a 
critical role in MHC class II antigen processing 
by stabilizing peptide-free class II alpha/beta 
heterodimers [26]. 


Immunity And 
Defense 


SP 


scr gb- 
X14254 
5 


17 


17 


Rattus norvegicus 
MHC-associated 
invariant chain 
gamma (la antigen- 
associated invariant 
chain) (li) [X14254]. 


MHC class II molecules at the surface of 
antigen presenting cells present antigenic 
peptides to CD4+ T helper cells, li plays a 
critical role in MHC class II antigen processing 
by stabilizing peptide-free class II alpha/beta 
heterodimers [26]. 


Immunity And 
Defense 


NU 


scr gb- 
bi27563 


18 


18 


Rattus norvegicus 
Ras-related protein 
Rab-2 [P05712]. 


Rab proteins form the largest branch of the Ras 
superfamily of GTPases. They are localized to 
the cytoplasmic face of organelles and vesicles 
involved in the biosynthetic/secretory and 
endocytic pathways in eukaryotic cells [27]. 


Intracellular 
Transport 




scr gb- 
X66871 
3 


19 


19 


Rattus norvegicus 
calpactin I heavy 
chain (annexin II) 
[X66871]. 


The annexins are a family of proteins that bind 
acidic phospholipids in the presence of Ca2+. 
Because annexin II bridge secretory granules to 
plasma membrane it has suggested that this 
protein may play a role in Ca(2+)-dependent 
exocytosis. Annexin II tetramer has also been 
demonstrated on the extracellular face of some 
metastatic cells where it mediates the binding of 
certain metastatic cells to normal cells. Annexin 
II tetramer is a major cellular substrate of 
protein kinase C and pp60src [28]. 


Intracellular 
Transport 


NU 


scr gb- 
I49379 
3 


20 


20 


Rattus norvegicus 
canalicular 
multispecific organic 
anion transporter 
(cMOAT) [L49379]. 


cMOAT mediates the hepatobiliary excretion of 
numerous organic anions. It has been shown 
that both multidrug resistance-associated 
protein (MRP1) and canalicular multispecific 
organic anion transporter (CMOAT/MRP2) have 
the ability to extrude glutathione conjugates 
(GS-X pump activity) from cells [29] [30]. 


Oxidative Stress- 
TOX 
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21 


21 


Rat gene fragment - 
775 bp. 85% SI 
(618/722) to Homo 
sapiens inter-alpha- 
trypsin inhibitor heavy 
chain ITIH1 [X63652]. 


Inter-alpha-trypsin inhibitor (IT!) is a complex 
protein containing two heavy polypeptide chains 
(H1 and H2) and a light chain, which in the free 
state is known as bikunin [31]. ITI is a 220 kDa 
serine proteinase inhibitor found in human 
serum [32]. 


Other 


NC 


scr gb- 
aw1417 
35_3 


22 


22 


Rat gene fragment - 
1561 bp. 98% SI 
(1002/1022) to Mus 
musculus serine 
proteinase inhibitor 
mBM2A [U96701J. 


Serine proteinase inhibitors (serpins) are 
classically regulators of extracellular proteolysis. 
Evidence suggests that some function 
intracellulariy as well [33]. 


Other 


SP 


af 18498 
3 


23 


23 


Rattus norvegicus 
osteoactivin 
[AF1 84983] . 


Osteoactivin cDNA was recently isolated from 
long bone and calvaria . In primary rat 
osteoblast cultures it was expressed at the 
highest levels during the later stages of matrix 
maturation and mineralization and correlated 
with the expression of alkaline phosphatase and 
osteocalcin. [34]. 
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Unknown, 241 bp. 
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Unknown, 283 bp. 




Novel 
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26 
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Unknown, 642 bp. 
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NU 
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27 


27 


Unknown, 866 bp. 
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NU 
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28 


28 


Unknown, 629 bp. 
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bm3833 
27 J 


29 


29 


Unknown, 1145 bp. 
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NU 
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30 
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Unknown, 3087 bp. 
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31 


31 


Unknown, 434 bp. 
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NU 
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1701427 

36J 


32 
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Unknown, 221 bp. 
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Unknown, 581 bp. 
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Unknown, 221 bp. 
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SP = Secreted Protein 

NC = Novel Rat Composition 

NU = Novel Rat Utility 
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SEQ ID 
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Definition 
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NU 


cszr 96561 
134 837604 
93 


35 


35 


Rattus norvegicus 

Carbamoyl- 

phosphate 

synthase 

[ammonia] 

(CPSASE I), 

mitochondrial 

precursor 

[P07756]. 


Mitochondrial protein involved in the urea acid cycle 
of ureotelic animals where the enzyme plays an 
important role in removing excess ammonia from 
the cell. Catalytic Activity: 2 ATP + NH(3) + CO(2) 
+ H(2)0 = 2 ADP + ORTHOPHOSPHATE + 
CARBAMOYL PHOSPHATE [17]. 


Amino Acid 
Metabolism 


NU 


scr_gb- 
X83855J 


36 


36 


Rattus norvegicus 
hepatocyte 
EP3alpha 
receptor [X83855]. 


FP^ rpnpntnrc fnr PrnQtanlanrlin fPf^ Ff2^ arp 
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primarily involved in inhibition of adenylyl cyclase 
via G(i) activation, and in Ca(2+)-mobilization 
through Gbetagamma from G(i). Along with G(i) 
activation, the EP3 receptor can stimulate cAMP 
production via G(s) activation [18]. 


Carbohydrate 
Metabolism 


NU 


cszr 229800 
465 190907 
286 


37 


37 


Rattus norvegicus 

non-neuronal 

enolase (NNE) 

(alpha-alpha 

enolase, 2- 

phospho-D~ 

glycerate 

hydrolase 

[X02610]. 


Enolase is a vital enzyme of the glycolytic pathway. 
It exists mainly in two forms, non-neuronal enolase 
(NNE) and neuron specific enolase (NSE). 
Catalytic Activity: 2-phospho-D-glycerate = 
phosphoenolpyruvate + H(2)0 [19]. 


Carbohydrate 
Metabolism 


SP 


scr gb- 
bi277612_1 


38 


38 


Rat gene 
fragment -1381 
bp. 89% SI 
(816/910) to Mus 
musculus for 
beta- 

hexosaminidase 
[Y00964]. 


Two genes, HEXA and HEXB, encode the alpha- 
and beta-subunits, respectively, of human beta- 
hexosaminidase. In the mouse, the corresponding 
genes are termed Hexa and Hexb. The subunits 
have the capacity to degrade a variety of substrates 
including oligosaccharides, glycosaminoglycans, 
and giycolipids containing Deta-iinked N- 
acetylglucosaminyl or N-galactosaminyl residues 
[20]. 


Carbohydrate 
Metabolism 


NU 


scr gb- 
j05266_3 


39 


39 


Rattus norvegicus 
mitochondrial H+- 
ATP synthase 
alpha subunit 
[J05266]. 


H+-ATP synthase catalyzes the synthesis and/or 
hydrolysis of ATP [21]. 


Energy 
Metabolism 


NU 


scr gb- 
m37394_5 


40 


40 


Rattus norvegicus 
epidermal growth 
factor receptor 
(Egfr) [M37394]. 


Egfr is involved in the initiation of oncogenic effect 
such as DNA synthesis, enhanced cell growth, 
invasion, and metastasis. Specific abrogation of 
EGFR results in cell cycle arrest, apoptosis, or 
dedifferentiate of cancer cells [22]. 


Cell Cycle 
Regulation 
(Regulation Of 
Proliferation) 
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NU 


scr gb- 
m64300__4 


41 


41 


Rattus norvegicus 
extracellular 
signal-related 
kinase (ERK2) 
[M64300J. 


The Raf/MEK/ERK signaling was the first MAP 

If inaco r_acran*p tf> hp pharpctpri7Pri It i*Q nrfihahk/ 
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one of the most well known signal transduction 
pathways among biologists because of its 
implication in a wide variety of cellular functions as 
diverse -and occasionally contradictory- as cell 
proliferation, cell-cycle arrest, terminal differentiation 
and apoptosis [23]. 


Cell Cycle 
Regulation 
(Regulation Of 
Proliferation) 


SP 


scr gb- 
M294409J 


42 


42 


Rat gene 

fragment - 526 bp. 

OO/O Ol \d\Qfd\J\J) 

to Mus musculus 
type I interferon 
receptor soluble 
isoform precursor 
(IFNAR2) 
[AF013486]. 


IPKfAR-9 id avnrAQQpH uhim litm iqIv and pvicfc a*; 
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both transmembrane and soluble forms. Recent 
evidence suggests murine IFNAR-2 as an efficient 
regulator of IFN responses. Type 1 interferons are 
cytokines that are important in defense against viral 
infections well as in the control of ceil proliferation 
[24] [25]. 


Cell Cycle 
Regulation 
(Regulation Of 
Proliferation) 


NU 


scr gb- 
ab015747_3 


43 


43 


Rattus norvegicus 

inetrleukin-4 

receptor 

(membrane-bound 
form) [AB015747]. 


IL-4 is a pleiotropic cytokine which plays a pivotal 

rnlp in chaninn immimp rPQnnn^PC Thp pffpftc of 
IU1C 111 ollapiliy IlllillUliC 1 Co|JUI loco. 1 lie CIICoLo \J[ 

IL-4 are mediated after binding to high affinity 
receptor complexes present on hematopoietic as 
well as non-hematopoietic cells. There is also 
evidence that IL-4 interaction with its receptor leads 
to signal transduction mechanisms that result in 
cellular proliferation and / or gene activation [26]. 


Cell Cycle 
Regulation 
(Regulation Of 
Proliferation) 


NU 


scr sc- 

191879433 

1 


44 


44 


Rattus norvegicus 
Crk-associated 
substrate, p130 
[D29766]. 


The Crk-associated substrate (Cas) is a unique 
docking protein with a Src homology 3 (SH3) 
domain. Aberrant CAS tyrosine phosphorylation 
may contribute to cell transformation by certain 
oncoproteins, including v-Crk and v-Src, and to 
tumor growth and metastasis [27] [28]. 


Cell Cycle 
Regulation 
(Regulation Of 
Proliferation) 


NC 


scr sc- 

140438096 

1 


45 


45 


Kai gene 

fragment - 383 bp. 
98% SI (125/127) 
to Homo sapiens 
Diacylglycerol 
kinase, delta 
[Q16760]. 
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in the signal transduction through modulating the 
balance between two signaling lipids, diacylglycerol 
and phosphatidic acid. Diacylglycerol is a protein 
kinase c activator. Thus, DGK is considered to 
regulate protein kinase C activity through the 
reduction of diacylglycerol [29] [30]. 


Cell Cycle / 
Proliferation 
(Basic 
Machinery) 


NU 


scr gb- 
x87~157_5 


46 


46 


Rattus norvegicus 
neurotensin 
endopeptidase 
[X87157]. 


Neurotensin is a 13-amino acid hormonal peptide 
which was first isolated from bovine hypothalamus. 
It is present in the digestive tract as well as in the 
central nervous system. It has a variety of biological 
activities as a central neurotransmitter or 
neuromodulator, and a peripheral hormone [20]. 


Cellular 

Communication 


NU 


scr gb- 
u66707_2 


47 


47 


Rattus norvegicus 

densin-180 

[U66707]. 


Densin-180 is a transmembrane protein that is 
strongly associated with the postsynaptic density in 
CNS neurons and is believed to function as a 
synaptic adhesion molecule [31]. 


Cellular 

Communication 



44 



NU 


scr gb- 
af017393_2 


48 


48 

* 


Rattus norvegicus 

cytochrome 

P4502F4 

(CYP4502F4) 

[AF017393]. 


The cytochromes P-450 are among the major 
constituent proteins of the liver mixed function 
monooxygenases. They play a central role in the 
metabolism of steroids, the detoxification of drugs 
and xenobiotics, and the activation of 
procarcinogens. In liver microsomes, this enzyme 
is involved in an NADPH-dependent electron 
transport pathway. It oxidizes a variety of 
structurally unrelated compounds, Including 
steroids, fatty acids, and xenobiotics [32]. 


Detoxification 
Response / 
Biotransformation 
-TOX 


NU 


scr sc- 

134241980 

1 


49 


49 


Rattus norvegicus 
cytochrome P450 
2B3 (CYP2B3) 
[U16214]. 


The cytochromes P-450 are among the major 
constituent proteins of the liver mixed function 
monooxygenases. They play a central role in the 
metabolism of steroids, the detoxification of drugs 
and xenobiotics, and the activation of 
procarcinogens. In liver microsomes, this enzyme 
is involved in an NADPH-dependent electron 
transport pathway. It oxidizes a variety of 
structurally unrelated compounds, including 
steroids, fatty acids, and xenobiotics [32]. 


Detoxification 
Response / 
Biotransformation 
-TOX 


NC 


scr_sc- 

191609675 

1 


50 


£0 


Rat gene 

fragment -21 7 bp. 
94% SI (67/71) to 
Streptococcus 
pneumoniae DNA 
polymerase III, 
alpha subunit R6 
[AAK99055]. 


DNA polymerase III is a replicative enzyme known 
to be essential in the DNA synthesis of Gram- 
positive bacteria [33]. 


DNA Metabolism 


NU 


scr gb- 
x17037_2 


51 


51 


Rat OX40 antigen 
[X17037]. 


OX40 is a member of the tumor necrosis factor 
family which is expressed by activated T 
lymphocytes [34]. 


Immunity And 
Defense 


SP 


scr_gb- 
bl291805J 


52 


52 


Rat gene 

fragment - 528 bp. 
76% SI (290/380) 
to Homo sapiens 
IgG Fc binding 
protein [D84239]. 


Fc gamma BP is widely expressed on mucosal 
surfaces and in external secretions lending support 
to the concept that Fc gamma BP is an important 
component of mucosal immunological defenses 
[35]. 


Immunity And 
Defense 


NU 


scr gb- 
aj000696_5 


53 


53 


Rattus norvegicus 
novel kinesin- 
related protein, 
KIF1D 
[AJ000696]. 


The proteins of the kinesin superfamily (KIFs) are 
microtubule-based molecular motors whose 
functions include the transport of membrane-bound 
organelles. The KIF1 subfamily members are 
monomelic and contain a number of amino acid 
inserts in surface loops [36]. 


Intracellular 
Transport 


NU 


scr gb- 
d79221_3 


54 


54 


Rattus norvegicus 
r-sly1 [U35364]. 


r-sly1 is a mammalian homologue to yeast Slylp 
which plays a critical role in endoplasmic reticulum 
to Golgi apparatus vesicle trafficking [37]. 


Intracellular 
Transport 



45 



NU 


m61937 


55 


55 


Rattus norvegicus 
dihydrodiol 
dehydrogenase 
[M61937]. 


Dihydrodiol dehydrogenase(s) (DD) have been 
implicated in the detoxication of proximate (trans- 
dihydrodioi) and ultimate carcinogenic (anti-diol- 
epoxide) metabolites of polycyclic aromatic 
hydrocarbons (PAHs). Although this pathway 
suppresses the formation of the PAH anti- and syn- 
diol epoxides (ultimate carcinogens), the process of 
autoxidation is anticipated to yield reactive oxygen 
species (ROS) [38]. 


Oxidative Stress- 
TOX 


NU 


cszr 229602 
935 183895 
355 


56 


56 


Rattus norvegicus 

metallothionein-i 

(mt-1)[J00750]. 


Metallothionein (MT) is a small, cysteine-rich, metal- 
binding protein. MT synthesis is induced by various 
stimuli such as heavy metals, oxidative stress, 
anticancer drugs and fasting stress. MT is capable 
of not only reducing metal toxicity but also 
scavenging free radicals [39]. 


Oxidative Stress- 
TOX 


NU 


scr gb- 
af 106944 J. 


57 


57 


Rattus norvegicus 
Peroxiredoxin III 
[AF106944]. 


Peroxiredoxins are novel family of anti-oxidative 
proteins comprise six members in mammals. They 
share a common reactive Cys residue in the N- 
terminal region, and are capable of serving as a 
peroxidase and involve thioredoxin and/or 
glutathione as the electron donor [40]. 


Oxidative Stress- 
TOX 


NU 


scr gb- 
m11794_3 


58 


58 


Rattus norvegicus 
metallothionein-2 
and 

metallothionein-1 
genes [M11794]. 


Metallothionein (MT) is a small, cysteine-rich, metal- 
binding protein. MT synthesis is induced by various 
stimuli such as heavy metals, oxidative stress, 
anticancer drugs and fasting stress. MT is capable 
of not only reducing metal toxicity but also 
scavenging free radicals [39]. 


Oxidative Stress- 
TOX 


NU 


scr gb- 
af069306J 


59 


59 


Rattus norvegicus 

transaldolase 

[AF069306]. 


Transaldolase is a key enzyme of the reversible 
nonoxidative branch of the pentose phosphate 
pathway (PPP) that is responsible for the generation 
of NADPH to maintain glutathione at a reduced 
state (GSH) and, thus, to protect cellular integrity 
from reactive oxygen intermediates (ROIs) [41], 


Oxidative Stress- 
TOX 


NU 


scr gb- 
d17310_4 


60 


60 


Rattus norvegicus 
steroid 3-alpha- 
dehydrogenase 
[D17310]. 


Steroid 3-alpha-dehydrogenase is an important 
multifunctional oxidoreductase capable of 
metabolizing steroid hormones, polycyclic aromatic 
hydrocarbons, and prostaglandins. It is also 
required for bile acid synthesis and has been 
suggested to play an important role in net bile acid 
transport across the hepatocyte [42]. 


Oxidative Stress- 
TOX 


NC 


scr gb- 
bf281 368_2 


61 


61 


Rat gene 
fragment - 1086 
bp. 80% SI 
(754/938) to 
Human Prt1 
homolog 
[U62583]. 


PRT1 is a component of translation initiation factor 
elF-3 and originally discovered in Saccharomyces 
cerevisiae [43]. 


Protein 
Metabolism 


NU 


scr gb- 
u56407_3 


62 


62 


Rattus norvegicus 

ubiquitin 

conjugating 

enzyme 

[U56407J. 


Ubiquitin-conjugating enzymes (UBC) catalyze the 
covalent attachment of ubiquitin to target proteins 
and are distinguished by the presence of a UBC 
domain required for catalysis [44]. 


Protein 
Metabolism 



46 



NC 


scr gb- 
ai406674J 


63 


63 


Rat gene 

fragment - 796 bp. 
96% SI (634/660) 
Mus musculus 
heterogeneous 
nuclear 

ribonucleoprotein 
C, clone 
MGC:5715 
IMAGE:3499283 
[BC004706]. 


Heterogeneous nuclear ribonucleoprotein (hnRNP) 
complexes, the stmctures that contain 
heterogeneous nuclear RNA and its associated 
proteins, constitute one of the most abundant 
components of the eukaryotic nucleus. hnRNPs 
appear to play important roles in the processing, 
and possibly also in the transport, of mRNA [45]. 


RNA metabolism 


NC 


scr gb- 
bf290678_2 


64 


64 


Rat gene 

fragment -716 bp. 
84% SI (542/643) 
to Mus musculus 
heterogeneous 
nuclear 

ribonucleoprotein 
G, splice variant 1 
[AJ237847]. 


Heterogeneous nuclear ribonucleoprotein (hnRNP) 
complexes, the structures that contain 
heterogeneous nuclear RNA and its associated 
proteins, constitute one of the most abundant 
components of the eukaryotic nucleus. hnRNPs 
appear to play important roles in the processing, 
and possibly also in the transport, of mRNA [45]. 


RNA metabolism 


NC 


scr_gb- 

UI^OOJUO 1 


65 


65 


Rat gene 

fragment - 456 bp. 
94% SI (430/456) 
to Mus musculus 
cardiac lineage 
protein 1 (Clp1) 


Mouse Clp-1 is a potential cardiac transcriptional 

rfrm ilatrtrv farf f\r fulfil 


rxiNrA iviciauuiiDi 1 1 


NU 


scr gb- 
d86383_2 


66 


66 


Rattus norvegicus 
Hex [D86383]. 


Hex is a homeobox protein which is believed to 
function as a transcriptional repressor and may be 
involved in the differentiation and/or maintenance of 
the differentiated state in hepatocytes [47]. 


RNA metabolism 


NU 


scr sc- 

133366194 

1 


67 


67 


Rattus norvegicus 
thymosin beta-10 
[M58405]. 


Thymosin beta 10 is one of a small family of 
proteins closely related in sequence to thymosin 
beta 4, recently identified as an actin-sequestering 
protein [48]. 


Structural Repair- 
TOX 


NC 


cszr 230290 
139 182026 
368 


68 


68 


Rat gene 

fragment - 249 bp. 
91% SI (227/248) 
to Mus musculus 
sex-determination 
protein homolog 
Femla 
[AF064447]. 


FEM-1 is a signal-transducing regulator in the C. 
elegans sex-determination pathway. The existence 
of FEM-1 homologs in the mouse raises the 
possibility that evolutionary conservation of ancient 
FEM-1 signaling interactions may play a role in 
vertebrate cell-fate determination [49]. 


Other 


NU 


scr gb- 
ai01 3477_2 


69 


69 


Rattus norvegicus 
VL30 element 
[M91234]. 


VL30 elements constitute a family of 
retrotransposons that are associated with cancer by 
their overexpression in rodent malignancies, their 
induction in a fibroblast response to anoxia which 
shares features with the malignant phenotype, and 
their presence recombined into Harvey murine 
sarcoma virus (HaSV) and Kirsten murine sarcoma 
virus [50]. 


Other 


NU 


scr gb- 
m91235__3 


70 


70 


Rattus norvegicus 
VL30 element 
[M91235]. 


VL30 elements constitute a family of 
retrotransposons that are associated with cancer by 
their overexpression in rodent malignancies, their 
induction in a fibroblast response to anoxia which 
shares features with the malignant phenotype, and 
their presence recombined into Harvey murine 
sarcoma virus (HaSV) and Kirsten murine sarcoma 
virus [50]. 


Other 



47 



NU 


cszr 204152 
648 191521 
095 


71 


71 


Unknown, 63 bp. 




Novel 


NU 


cszr 204152 
792 "19151 7 
979 


72 


72 


Unknown, 133 bp. 




Novel 


NU 


cszr 204229 
614J91891 
yoo 


73 


73 


UnKnOWn, \4A Dp. 




INUVCl 


Ml 1 

NU 


cszr_204229 
bio 191892 
510 


74 


7 A 


Unknown, 124 bp. 




Novel 


Ml 1 

NU 


scr gb- 
aa801331J 


f 0 


75 


Unknown, 1252 

bp. 




Novel 


Ml 1 

NU 


scr_gb- 
aa899865_3 


76 


76 


unKnown, 1^41 
bp. 




Novel 


Ml 1 

NU 


scr gb- 
aa997629J 


77 


77 


Unknown, 396 bp. 




Novel 


Ml 1 

NU 


scr_gb- 
aa997691J 


fo 


78 


Unknown, 473 bp. 




Novel 


Ml 1 

NU 


scr_gb- 
ai411514_4 


70 

79 


79 


unKnown, MZ\ 
bp. 




Novel 


NU 


scr gb- 
aw142560_3 


80 


80 


Unknown, 695 bp. 




Novel 


NU 


scr go- 
aw533305_2 


81 


81 


Unknown, 771 bp. 




Novel 


Ml I 

NU 


scr gb- 
aw915573_2 


od 


82 


unKnown, i£o<L 
bp. 




Novel 


Ml 1 

NU 


scr gb- 
be108509J 


QO 
OO 


83 


Unknown, 422 bp. 




Novel 


Ml 1 

NU 


scr_gb- 
be111483J 


QA 
04 


84 


Unknown, 445 bp. 




Novel 


NU 


scr gb- 
be120910J 


85 


85 


Unknown, 482 bp. 




Novel 


NU 


scr gb- 
bf285287_1 


oc 
OO 


86 


Unknown, 784 bp. 




Novel 


Ml 1 

NU 


scr gb- 
bf390383J 


Q7 

Of 


87 


Unknown, 486 bp. 




Novel 


Ml 1 

NU 


scr gb- 
bf558463_2 


OO 


88 


Unknown, 921 bp. 




Novel 


Ml 1 

NU 


scrjgb- 
bf560709J 


09 


89 


Unknown, 525 bp. 




Novel 


NU 


scr gb- 
bg662990_2 


90 


90 


Unknown, 930 bp. 




Novel 


Ml 1 


scr gb- 
W278552J 


Q1 

a i 


91 


1 InUnrAA/n 4 HAH 
UIIM1UWII, lUOU 

bp. 




Novel 


NU 


scr gb- 
bi278749_1 


92 


92 


Unknown, 1158 
bp. 




Novel 


NU 


scr_gb- 
bi295938_1 


93 


93 


Unknown, 1241 
bp. 




Novel 



48 



NU 


scr gb- 
bi296376J 


94 


94 
34 


Unknown, 2695 
bp. 




Novel 


NU 


scr gb- 
bm384392J 


95 


95 


Unknown, 2423 
bp. 




Novel 


NU 


scr gb- 
bm387477J 


96 


96 


Unknown, 610 bp. 




Novel 


NU 


scr_gb- 
bm986259_1 


97 


97 


Unknown, 1047 
bp- 




Novel 


NU 


scr gb- 
s69874_5 


98 


98 


Unknown, 1191 
bp. 




Novel 


NU 


scr sc- 

119263563 

1 


99 


99 


Unknown, 384 bp. 




Novel 


NU 


scr sc- 

132556005 

1 


100 


100 


Unknown, 181 bp. 




Novel 


NU 


scr sc- 

132570828 

1 


101 


101 


Unknown, 130 bp. 




Novel 


NU 


scr sc- 

132947646 

1 


102 


102 


Unknown, 50 bp. 




Novel 


NU 


scr sc- 

133387221 

1 


103 


103 


Unknown, 296 bp. 




Novel 


NU 


scr sc- 

133555783 

1 


104 


104 


Unknown, 321 bp. 




Novel 


NU 


scr sc- 

133678871 

1 


105 


105 


Unknown, 92 bp. 




Novel 


NU 


scr sc- 

133725675_ 

1 


106 


106 


Unknown, 94 bp. 




Novel 


NU 


scr sc- 

133955481 

1 


107 


107 


Unknown, 343 bp. 




Novel 


NU 


scr sc- 

134521597 

1 


108 


108 


Unknown, 238 bp. 




Novel 


NU 


scr sc- 

172126480 

1 


109 


109 


Unknown, 247 bp. 




Novel 


NU 


scr sc- 

172130231_ 

1 


110 


110 


Unknown, 196 bp. 




Novel 


NU 


scr sc- 

1 7275501 0_ 

1 


111 


111 


Unknown, 457 bp. 




Novel 


NU 


scr sc- 

188295137 

1 


112 


112 


Unknown, 85 bp. 




Novel 


NU 


scr sc- 
190079504 


113 


113 


Unknown, 241 bp. 




Novel 



49 





1 












NU 


scr sc- 

191455923 

1 


114 


114 


Unknown, 388 bp. 




Novel 


NU 


scr sc- 

1 954601 51_ 

1 


115 


115 


Unknown 444 bo 




Novel 


NU 


scr_sc- 

198205946 

1 


116 


116 


Unknown, 135 bp. 




Novel 


NU 


scr sc- 
2573087J 


117 


117 


Unknown, 246 bp. 




Novel 


NU 


scr sc- 
2585074J 


118 


118 


Unknown, 203 bp. 




Novel 


NU 


scr sc- 
8571871_2 


119 


119 


Unknown, 233 bp. 




Novel 


NU 


scr sc- 
87f31837J 


120 


120 


Unknown, 300 bp. 




Novel 


NU 


scr sc- 
8786941 3 J 


121 


121 


Unknown, 351 bp. 




Novel 


NU 


scr __gb- 
ai233262_2 


122 


122 


Unknown, 889 bp. 




Novel 


NU 


cgrrs0h0310. 
9 13952-135 


123 


123 


Unknown 310 bD 




Novel 


NU 


scr gb- 
m13100.5_2 


124 


124 


Unknown, 100 bp. 




Novel 


NU 


scr sc- 

170396977 

1 


125 


125 


Unknown, 350 bp. 




Novel 


NU 


scr sc- 
14059147_2 


126 


126 


Rat gene patent 
WO0210453, 254 
bp- 




Unknown 


NU 


scr sc- 
87750810 J 


127 


127 


Rat gene patent 
WO0210453, 
1063 bp. 




Unknown 


NU 


cszr 202034 
260 190929 
676 


128 


128 


Rattus norvegicus 
Tclone4 [U30788]. 




Unknown 



Using the TOXMARKER gene information listed in Table 4, zone 3 necrosis-related 
genes expressed in vitro were confirmed. Confirmed genes are listed in Table 5. 



5 Table 5 



Gene ID 


«°l S io° Definition 
Number NO 


Human Ortholog 


SEQ ID NO: 


Amino Acid Metabolism 


cszr 96561134 
83760493 


35 39 Rattus 

norvegicus 
Carbamoyl- 


CPS1: Carbamyl phosphate synthetase I [D90282, 
NM_001875] 


129 



50 





phosphate 

synthase 

[ammonia] 

(CPSASE I), 

mitochondrial 

precursor 

[P07756]. 






Carbohydrate Metabolism 


scr gb- 
W277612J 


38 38 

Rat gene 
fragment - 1381 
bp. 89% SI 
(816/910) to Mus 
musculus for 
beta- 

hexosaminidase 
[Y00964]. 


NM_000521 

>rsna.KfcrotUnUMAiNUINA- 

ID:NM_000521 |acc:NM_000521 
fgeneName-'HEXB" /definition="Homo sapiens 
hexosaminidase B (beta polypeptide) (HEXB), 
mRNA." /proteinjd= n NPJ)00512.r 
forganism="Homo sapiens" /CDS= ,, 76..1746 ,, 1 1857 
bp. 

>gb:GENBANK-ID:AF378118|acc:AF378118.1 
Homo sapiens cervical cancer proto-oncogene 7 
mRNA, complete cds - Homo sapiens, 1892 bp. 


130 


cszr 229800465 
190907286 


1 29 131 Rat non-neuronal 
enolase (NNE) 
(alpha-alpha 
enolase, 2- 
phospho-D- 
glycerate 
hydrolase) 
[X02610J. 






Energy Metabolism 


scr_gb-j05266_3 


39 .39 

Rattus 
norvegicus 
mitochondrial H+- 
ATP synthase 
alpha subunit 
[J05266], 


NM 004046 

>rshd:REFSEQHUMANDNA- 
!D:NM_004046|acc:NMJ)04046 
f'geneName= ,, ATP5A1 "7definition="Homo sapiens 
ATP synthase, H+ transporting, mitochondrial F1 
complex, alpha subunit, isoform 1 , cardiac muscle 
(ATP5A1), mRNA." /protein Jd= n NP_004037.1" 
torganism="Homo sapiens" /CDS="912..2573 H , 
2725 bp. 


132 


Cell Cycle Regulation (Regulation Of Proliferation) 


scr gb~ 
ab015747_3 


43 43 Rat interleukin-4 
receptor 
(membrane- 
bound 

form)(AB015747. 
1:99%/3501, 
p=0.000000), 
3520 bp. 


X52425.1 Human IL-4-R mRNA for the interleukin 4 
receptor 


133 


scr gb- 
ai233262_2 


122 122 Rat gene 

fragment, 889 bp, 
93% identical 
over 679 bp to 
mouse RANBP4 
[AF123388]. 


weakly similar to PUTATIVE IMPORTIN BETA-4 
SUBUNIT [AK027871, NM_024658]. 


134 


scr gb- 
W294409J 


42 42 Rat gene 

fragment, 526 bp, 
89% SI (327/365) 
to mouse type I 
interferon 
receptor, IFNaR2 


Ifnar2: interferon receptor ifnar2-1 [L41944, 
NM_000874] 


135 



51 





subunit, 1109 bp 
(Y09813). 






scr_gb-m37394_5 


40 40 Rattus 

norvegicus 
epidermal growth 
factor receptor 
(Egfr) [M37394]. 


EGFR: epidermal growth factor receptor [K03193, 
NM_005228]. 


136 


scr_gb-m64300_4 


41 41 Rattus 

norvegicus 
extracellular 
signal-related 
kinase (ERK2} 
[M64300]. 


MAPK1 : Mitogen-activated protein kinase 1 
[BC017832, NM_002745]. 


137 


scr sc- 

191 879433 J 


44 44 Rat Crk- 
associated 
substrate, p130 
[D29766]. 


BCAR1: breast cancer anti-estrogen resistance 1 
[AK026121, NMJJ14567]. 


138 


Cellular Communication 


scr gb- 
m13100.5_2 


124 


124 


Rattus 

norvegicus gene 
for histamine N- 
methyl transferase 
, exon 1 and 2 
[AB007833J. 






scr_gb-x87157_5 


46 46 Rattus 

norvegicus 
neurotensin 
endopeptidase 
[X87157]. 


NLN: neurolysin [AJ300837, NM_020726]. 


139 


Detoxification Response / Biotransformation-TOX 


scr gb- 
af017393_2 


48 48 Rattus 

norvegicus 

cytochrome 

P4502F4 

(CYP4502F4) 

[AF017393]. 


CYP2F1: cytochrome P450, subfamily IIF, 
polypeptide 1[J02906, NMJ500774]. 


140 


scr sc- 
134241 980 J 


49 49 Rat cytochrome 
P450IIB3 
(P450IIB 
subfamily) 
mRNA, complete 
cds (M20406.1: 
100%/279, 
p=5.0e-156), 367 
bp. 


MSRA: methionine sulfoxide reductase A 
[AJ242973, NM 012331]. 


141 


DNA Metabolism 


scr gb- 
bi296376_1 


94 94 Rat ribosomal 
DNA external 
transcribed 
spacer 1 (ETS1) 
[X16321], Rat 
45S rDNA gene 
transcription 
initiation region 
[X00677J, and 
Rat ribosomal 







52 





RNA1.6 small 
subunit (SS1.6) 
gene, 3' end 
[M30843] - 2695 
bp. 






Immunity And Defense 


scr_gb-x17037_2 


51 


51 


Rat OX40 antigen 
[X17037J. 


TNFRSF4: tumor necrosis factor receptor 
superfamily, member 4 
[AW293499,S76792,X75962 I NM_003327]. 


142 


cgrrs0h0310.9 13 
952-135 


123 


123 


Rat lipocortin-V 
mRNA, complete 
cds (M21730.1: 
99%/1419, 
p=0.000000), 
1744 bp. 






Intracellular Transport 


scr gb- 
aj000696_5 


53 


53 


Rattus 

norvegicus novel 
kinesin-related 
protein, KIF1D 
[AJ000696J. 


KIF1C: kinesin family member 1C [AB014606, 
NMJ06612]. 


143 


scr_gb-d79221_3 


54 


54 


Rattus 

norvegicus r-s!y1 
[U35364]. 


Vesicle transport-related protein [AF1 10646, 
NMJJ16163]. 


144 


Oxidative Stress-TOX 


cszr 229602935 
183895355 


56 


56 


Rat 

metal!othionein-i 
(mt-1) 






m61937 


55 


55 


Rattus 

norvegicus 

dihydrodiol 

dehydrogenase 

[M61937]. 






scr_gb- 
af069306_1 


59 


59 


Rattus 
norvegicus 
transaldolase 
[AF069306]. 


Clone MGC:2838 I MAGE: 2966784 [BC001517]. 


145 


scr gb- 
af 106944,3 


57 


57 


Rattus 
norvegicus 
Peroxiredoxin (II 
[AF106944]. 






scr_gb-d17310_4 


60 


60 


Rattus 
norvegicus 
steroid 3-a!pha- 
dehydrogenase 
[D17310]. 


MTCH2: mitochondrial carrier homolog 2 
[AF085361,NMJ)14342] 


146 


scr_gb-m11794_3 


58 


58 


Rattus 
norvegicus 
metallothionein-2 
and 

metallothionein-1 
genes [M 11 794]. 


MT2A: metallothionein 2A [S52379, NM_005953]. 


147 


Protein Metabolism 



53 



scr gb- 
bf281368_2 


61 


61 


Rat gene 
fragment - 1086 
bp. 80% SI 
(754/938) to 
Human Prt1 
homolog 
[U62583]. 


elF3: eukaryotic translation initiation factor 3 
[U78525, NM_003751] 
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RNA metabolism 


scr gb- 
ai406674_1 


63 


63 


Rat gene 
fragment, 796 bp. 
91% SI to human 
heterogeneous 
nuclear 

ribonucleoprotein 
C(C1/C2), 
[XMJ 66936] 


HNRPC: heterogeneous nuclear ribonucleoprotein 
C [BC003394, NMJJ04500]. 
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scr gb- 
bi288503_1 


65 


65 


Rat gene 
fragment 456 bp. 
94% SI (430/456 
) to Mus 

musculus cardiac 
lineage protein 1 
(Clp1) 

(AY090614.1) 


HEX1M1 protein [AB021179, NM_006460]. 


150 


scr_gb-d86383_2 


66 


66 


Rattus 

norvegicus Hex 
[D86383]. 


HHEX: hematopoietically expressed homeobox 
[BC015110.X67235, NM_002729]. 
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Membrane Transport / Transporters 


scr sc- 
8571871_2 


119 


119 


Rattus 

norvegicus Na/Pi 
cotransporter 4 
[AB025224.1]. 


U90545.1|HSU90545 Human sodium phosphate 
transporter (NPT4) mRNA, complete cds 


152 


Other 


scr gb- 
af311311_2 


130 


153 


Rattus 
norvegicus 
P116RIPmRNA, 
complete cds 
(AF311311.1: 
99%/3285, 
p=0.000000), 
3905 bp. 






scr gb- 
ai013477_2 


69 


69 


Rattus 

norvegicus VL30 

element 

[M91234]. 






scr gb- 
bm986259_1 


97 


97 


Rat gene 
fragment, 1047 
bp, 86% SI 
(647/748) to 
human CGl-126 
protein mRNA, 
1091 bp 
(AF151884). 


HSPC155: hypothetical protein HSPC155 
[AF161504, NM_016406] 


154 


cszr 230290139 
182026368 


131 


155 


Similar to Mus 
musculus, 
feminization 1 a 
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homolog (C. 
elegans), clone 

IVIOw.OOUcJ 

IMAGE:2811079, 
mRNA, complete 
cds 

(BC009161.1: 
91%/244, p=3.4e- 
085), 249 bp. 






scr_gb-m91 235J3 


70 


70 


Rattus 

norvegicus VL30 

element 

[M91235J. 






Novel 


scr gb- 
aa801331_1 


75 


75 


Unknown, 1252 
bp. 






scr_gb- 
aa899865_3 


76 


76 


Unknown, 1241 
bp. 






scr gb- 
ai411514_4 


79 


79 


Unknown, 1221 
bp. 


KIAA0652 gene product [BC006191, NM_014741]. 


156 


scr gb- 
bf558463_2 


88 


88 


Rat gene 
fragment, 921 bp. 


Clone IMAGE:4052356, partial cds. [BC014348]. 


157 


scr gb- 
bi278552_1 


91 


91 


Unknown, 1060 
bp- 


KIAA0427 gene product [AB007887, NMJ)14772]. 


158 


scr gb- 
bi295938_1 


83 


83 


Unknown, 1241 
bp. 


Hypothetical protein FU13409 [BC015897, 
NM_024617]. 


159 


scr gb- 
bm384392_1 


85 


85 


Unknown, 2423 
bp. 


Hypothetical protein MGC3067 [BC002457, 
NMJJ24295]. 


160 


scr_gb-s69874_5 


98 


98 


Unknown, 1191 
bp- 


COL4A1: collagen, type IV, alpha 1 [AH002741, 
NM_001845]. 


161 


scr sc- 
132947646 J 


102 


102 


Rat gene 
fragment, 50 bp. 






scr sc- 
1721 26480 J 


109 


109 


Unknown, 247 
bp. 






scr sc- 
1882951 37 J 


112 


112 


Unknown Gene, 
85 bp. 






scr sc- 

191 455923 J 


114 


114 


Unknown, 388 
bp- 


Heterogeneous nuclear ribonucleoprotein M 
[NMJ505968]. 


162 


scr sc- 
1954601 51 J 


115 


115 


Unknown, 444 
bp. 






Unknown 


cszr 204152648 
191521095 


71 


71 


Ul-R-A1-ek-e-09- 
O-Ul.sl UI-R-A1 
Rattus 

norvegicus cDNA 
clone UI-R-A1- 
ek-e-09-0-UI 3', 
63 bp. 


XM 114110 

>rshd:REFSEQHUMANDNA- 

ID:XMJ 141 10|acc:XM_1 141 1 0 

fgeneName= n LOC200081 " /definition-'Homo 

sapiens similar to muscle-derived protein MDP77 

isoform 2 (LOC200081), mRNA." 

^protein id= n XP 114110.1" /organism="Homo 

sapiens" /CDS ="94.. 873", 3962 bp. 


163 


cszr 204152792 
191517979 


72 


72 


Rat gene 
fragment, 131 bp, 
99% SI (130/131) 
to mouse IMAGE 


AB032968.1 Homo sapiens mRNA for KIAA1142 
protein, partial cds 
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clone 

(BC030389), 
2072 bp. 






scr gb- 
bf390383J 


87 87 UI-R-CA1-bcg-d- 
03-0-UI.S1 Ul-R- 
CA1 Rattus 
norvegicus cDNA 
clone UI-R-CA1- 
bcg-d-03-0-UI 3', 
486 bp. 
[BF390383.1]. 






scr gb- 
bf560709J 


89 89 UI-R-C1-kd-h-12- 
0-Ul.r2 UI-R-C1 
Rattus 

norvegicus cDNA 
clone UI-R-C1- 
kd-h-12-0-UI 5\ 
mRNA 

sequence(BF560 
709.1: 99%/464, 
p=9.2e-264), 525 
bp. 


CTNNA2: Catenin (cadherin-associated protein), 
alpha 2 [M94151, NMJ3043891. 
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scr sc- 
14059147_2 


126 126 Rat gene 

fragment - 254bp. 
75% SI (190/252) 
to Homo sapiens 
mRNA for 
putative 
progesterone 
binding protein 
[AJ002030]. 


>gb:GENBANK-ID:HSAJ2030|acc:AJ002030.1 
Homo sapiens mRNA for putative progesterone 
binding protein - Homo sapiens, 1874 bp. 


166 


scr sc- 
1721 30231 J 


110 110 Rat gene 

fragment* 196 
bp. 92% SI 
(181/196) to Mus 
musculus cDNA 
clone 

IMAGE:5051929 
5'[BI146266.1]. 


AK056165.1| Homo sapiens cDNA FLJ31603 fis, 
clone NT2RI2002654 


167 



Example 6: Genes Involved in Zone 3 Necrosis in vivo 

There is ample evidence to support the importance of ATP depletion as a mechanism that 
leads to cell necrosis [3]. Rat trihydroxycoprostanoyl-CoA oxidase was selected as an in vivo 
5 marker for zone 3 necrosis and was found to be downregulated in the dataset. This enzyme is 
one of the three acyl-CoA oxidases found in rat liver peroxisomes and is responsible for the beta- 
oxidation of fatty acids [16]. Downregulation of an enzyme involved in beta-oxidation 
represents an interruption in a pathway that can lead to ATP production. Consistent with this is 
the downregulation of an enzyme involved in amino acid metabolism. Betaine-homocysteine 
10 methyltransferase (BHMT) catalyzes the transfer of an N-methyl group from betaine to 
homocysteine to form dimethylglycine and methionine, respectively [14]. Interestingly, 
alterations in methionine metabolism have been known to be associated with human liver 
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cirrhosis for many years [15]. As discussed earlier, severe necrosis is involved in the 
pathogenesis of cirrhosis [9]. Thus, necrosis itself can lead to an inflammation mediated injury. 
A marker involved in an inflammation and defense response was found to be upregulated in this 
in vivo marker set. Rat MHC-associated invariant chain gamma stabilizes MHC class II 
5 molecules that are at the surface of antigen presenting cells [26]. Rat MHC-associated invariant 
chain gamma is predicted to be a secreted protein based upon PSORT, SIGNALP, and 
Hydropathic Profile (HP) analysis 

Rat ephrin type-B receptor 1 precursor (EphBl) was downregulated as compared to controls. 
Upon binding to its ligand, EphBl has been shown to activate c-Jun kinase through recruitment 

10 of an intermediate protein Nek [18]. c-Jun is an oncogene involved in cell proliferation. 
Regulation of proliferation is an integral part of necrotic cell death, whether it results from 
compensatory liver regeneration of healthy cells or cell cycle arrest of unhealthy cells. Indeed, 
ribosomal protein S6 was selected as a marker and was found to be upregulated. It has been 
known for 20 years that the ribosomal protein S6 is quickly phosphorylated when cells are 

1 5 stimulated to grow or divide [19]. Phosphorylation of S6 occurs in response to mitogens by 
activation of one or more protein kinase cascades, including MAP kinases [20]. Rat annexin II 
belongs to a family of proteins that in the presence of Ca 2+ bind to acidic phospholipids. They 
may also cross-link plasma membrane phospholipids with actin and the cytoskeleton, and 
possibly play a part in exocytosis, since they are also involved in granule aggregation and 

20 membrane fusion [28] . Annexin II was found to be an upregulated marker and may be 

indicative of loss of structural integrity within the cell. Though annexin I and annexin II have 
also been identified as major substrates for the tyrosine kinase activity associated with epidermal 
growth factor receptor (Egfr). Rat Egfr signals through a number of pathways, including the 
MAP kinase pathway, to regulate proliferation [35]. 

25 Canalicular multispecific organic anion transporter (cMOAT), an upregulated gene in this in 
vivo marker set, has been shown to arbitrate the hepatobiliary elimination of many organic ions 
[29]. It has also been shown to remove glutathione conjugates from cells [30]. Glutathione 
conjugation is a cellular adaptation to the generation of reactive oxygen species (ROS) [36]. 
ROS occurs when oxidative phosphorylation becomes uncoupled during ATP generation. When 

30 molecular oxygen is reduced in such a manner damaging amounts of (V, H2O2 and OH are 
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formed in the process. ROS are thought to participate in necrosis through their reaction with all 
forms of biological macromolecules including lipids, proteins, nucleic acids and carbohydrates 
[37-39]. 

Markers were chosen from other biochemical pathways as well. This includes the 
5 upregulation of rat heat shock protein 86 (hsp86) which may be indicative of cellular stress. Rat 
ras-related protein (Rab-2), a protein involved in intracellular transport, was also upregulated. 
Two protease inhibitors, rat homologue to human inter-alpha-trypsin inhibitor heavy chain 
(ITIH1) and rat homologue to mouse serine proteinase inhibitor mBM2A were downregulated 
and upregulated, respectively. Human ITI is found in human serum and is predicted to be a 
10 secreted protein based upon PSORT, SIGNALP, and HP analysis [32]. Finally, numerous genes 
involved in xenobiotic metabolism were diminished after exposure to the zone 3 necrotic agents. 
However, two, rat epoxide hydrolase and rat aflatoxin Bl aldehyde reductase, were found to be 
upregulated. 

There were 1 1 markers chosen for this PTS marker set that did not match any known 
1 5 genes in the database and have novel composition. One gene, rat osteoactivin has no known 
association to any of the histopathologically relevant biochemical or toxicological pathways but 
is predicted to be a secreted protein based upon PSORT, SIGNALP, and HP analysis. 

Example 7: Genes Involved in Zone 3 Necrosis in vitro 

Several of the biochemical events consistent with necrosis are represented in the in vitro 
20 marker set that is predictive of zone 3 necrosis. There is ample evidence to support the 

importance of ATP depletion as a mechanism that leads to cell necrosis [3]. Included in this is 
documentation that the production of ATP via glycolysis can protect a cell from necrosis when 
oxidative phosphorylation is inhibited [51-53]. The alpha subunit of rat ATP synthase was found 
to be downregulated in the in vitro zone 3 necrosis marker set. ATP synthase is the final enzyme 
25 in the electron transport chain and is ultimately responsible for catalyzing the synthesis of ATP. 
Downregulation of such a key enzyme is indicative of loss of ATP within the cell. Rat 
nonneuronal enolase is another marker that was found to be downregulated. Enolase is a vital 
enzyme in the glycolysis pathway that converts glucose to pyruvate. Glucose is a preferred 
carbon source and generated the highest return of ATP per unit of expended energy. 
30 Downregulation of enolase may represent a depletion of glucose stores within the cell. The rat 
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homologue to mouse beta-hexosaminidase is a protein involved in oligosaccharide and 
glycosaminoglycan degradation and was found to be upregulated in this marker set [20]. 
Upregulation of this marker may represent the cells attempt to maintain glucose supplies. The rat 
homologue of this gene was found to be a secreted protein based on protein based upon PSORT, 
5 SIGNALP, and HP analysis. Two markers related to ATP depletion were found to be 

upregulated in this marker set. Rat EP3 alpha receptors for prostaglandin has been found to be 
involved in the inhibition of adenylyl cyclase, which catalyzes the conversion of ATP to cAMP 
[18]. Inhibition of this process would be consistent with a lack of ATP within the cell. 
Carbamoylphosphate synthase is a mitochondrial protein that removes excess ammonia in the 
10 cell via the urea acid cycle. Upregulation of this rat enzyme may indicate a resort to utilize 
amino acids as a source of energy. 

Uncoupling of electron transport during oxidative phosphorylation in the above process 
can lead to the formation of excessive amounts of Reactive Oxygen Species (ROS). When 
molecular oxygen is reduced in such a manner damaging amounts of O2-, H2O2 and OH are 

1 5 formed in the process. ROS are thought to participate in necrosis through their reaction with all 
forms of biological macromolecules including lipids, proteins, nucleic acids and carbohydrates 
[54-56]. Cells have adapted to the generation ROS through an elaborate antioxidant defense 
system. Two such mechanisms of defense are found to be upregulated in this marker set. A rat 
metallothionein isoform represents one of these mechanisms. Metallothionein is a small 

20 cysteine-rich metal binding protein that mediates heavy metal response and can play a role in ion 
homeostasis has the ability to scavenge free radicals and has been found to be induced under 
oxidative stress conditions [39]. Transaldolase is a key enzyme in the nonoxidative branch of the 
pentose phosphate pathway that can reduce the amount of reactive oxygen intermediates though 
the maintenance of glutathione at a reduced state [41]. Glutathione is critical for scavenging 

25 mitochondrial ROS through glutathione reductase and peroxidase systems. Interestingly rat 
peroxiredoxin HI, a member of a novel family of anti-oxidative proteins, was found to be 
downregulated in this marker set. Peroxiredoxins have the ability to reduce H2O2 by using 
thioredoxin or glutathione as an electron donor [40], Downregulation of this protein may 
represent a preference to remove H2O2 through one of the other defense mechanisms available to 

30 the cell. Similarly rat metallothionein 1, another isoform of metallothionein, was downregulated. 
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The fact that there are two rat metallothionein isoforms found in this marker set modulated in 
opposite directions may also indicate a preference of one form over the other or may represent a 
redundancy in the pathway. Dihydrodiol dehydrogenase is a marker, which was found 
downregulated and may play a role in the amount of ROS generated in the cell This enzyme is 
5 believed to yield ROS upon detoxification of polycyclic aromatic hydrocarbons [38]. Thus 
downregulation of this process may be an attempt to limit the overall amount of ROS within the 
cell. 

As discussed earlier, hepatic regeneration is a response to cellular necrosis. This process 
involves re-entry of surviving liver cells into the cell cycle to replace lost tissue mass [57], 

10 Though this normal reaction to liver injury can, if uncontrolled, lead to the early onset of hepatic 
carcinogenesis. Several markers predictive of in vitro zone 3 necrosis were found to be involved 
in the regulation of cellular proliferation in the cell. These markers were consistently 
upregulated and are represented by growth factor receptors (rat Egfr), cytokine receptors (rat IL- 
4r & a rat gene homologue to Mus musculus fciar-2 receptor), MAPK signaling cascades (rat 

1 5 Erk2), as well as a gene involved in the regulation of protein kinase C activity (rat homologue to 
human DGK-delta). Rat epidermal growth factor receptor signals through a number of 
pathways, including the MAP kinase pathway, to regulate proliferation. However, under certain 
conditions stimulation of this pathway can lead to cell growth arrest and the induction of 
apoptosis [22]. Interestingly rat extracellular signal-related kinase 2 (Erk2) was also found to be 

20 upregulated. Erk2 is a member of the Raf/MEK/ERK signaling pathway that was the first MAP 
kinase cascade to be characterized [23]. Rat interleukin-4 receptor (IL-4r) is an upregulated 
marker for in vitro zone 3 necrosis. While EL-4 is a cytokine that has immunomodulatory effects, 
there is evidence that IL-4 interaction with its receptor can lead to such downstream effects as 
gene activation and cellular proliferation [26]. A rat homologue to mouse soluble isoform 

25 precursor type I interferon receptor (Ifhar-2) represents the upregulation of another cytokine 

receptor. Recent data shows that murine Ifhar-2 is an effective regulator of interferon responses 
[24]. It is known that type I interferons play a role in cell proliferation [25]. Murine Ifhar-2 
soluble form is predicted to be a secreted protein based upon PSORT, SIGNALP, and 
Hydropathic Profile (HP) analysis. A rat homologue to human diacylglycerol kinase, delta 

30 (DGK-delta) was found to be upregulated in this marker set. DGK is a signal transduction 
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enzyme that mediates protein kinase C activity by modulating intracellular concentrations of two 
signaling lipids, diacylglycerol and phosphatidic acid [29]. Protein kinase C is a family of serine- 
threonine kinases that is known to regulate proliferation and apoptosis [30]. The only 
downregulated marker involved in regulation of cell proliferation was rat Crk-associated 
5 substrate (Cas) pi 30, a unique docking protein with a Src homology 3 (SH3) domain. Tyrosine 
phosphorylation of Cas has been implicated in integrin mediated activities including cell 
proliferation and survival [27]. Additionally, Cas, upon interaction with Src, has been shown to 
be involved in a H2O2 activation of cJun NH(2) terminal kinase (Jnk) pathway [28]. 

Some markers may have a less clear association with a necrosis specific pathway. This 

10 includes the upregulation of rat VL30 element, a retrotransposon that has been found to be 
upregulated in rodent malignancies but which a specific role has not been identified [50]. Rat 
OX40 antigen was found to be downregulated. OX40 is a member of the tumor necrosis factor 
family that is expressed by activated T lymphocytes and may indicate the presence of 
inflammatory events [34]. Also relevant to an immunological response is the upregulation rat 

1 5 homologue to Homo sapiens IgG Fc binding protein. This protein is widely expressed on 

mucosal surfaces and in external secretions [35]. The rat homologue to human IgG Fc binding 
protein is predicted to be a secreted protein based upon PSORT, SIGNALP, and HP analysis. 
This marker set includes two markers involved with protein metabolism. This includes the 
upregulation of rat ubiquitin-conjugating enzyme (UBC), which catalyzes the covalent 

20 attachment of ubiquitin to a target protein. The ubiquitin/proteasome pathway is the main non- 
lysosomal route for intracellular protein degradation in eukaryotes. It is important to many cell 
processes including cell-cycle progression and more recently has been found to target regulatory 
molecules found in the apoptotic cell death pathway [44]. Another protein that is involved with 
protein metabolism and was found to be downregulated is a rat homologue to human Prtl. Little 

25 is known of this protein, except that it a component of the translation initiation factor elf-3 [43]. 
Rat thymosin beta- 10 is a marker that was downregulated. It is a protein that is believed to be 
involved in the sequestering of actin and may be indicative of loss of structural integrity of the 
cell [48]. 

Markers were chosen from several other biochemical pathways as well. A couple of 
30 genes involved in xenobiotic metabolism were induced after exposure to the zone 3 necrotic 
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agents. These are rat cytochrome P-450's 2F4 and 2B3, which were both upregulated. Several 
genes that are involved with the synthesis or transport of RNA were also chosen as markers. 
These include the upregulation of the rat homologue to mouse heterogeneous ribonucleoprotein 
C, rat homologue to mouse cardiac lineage protein 1 and rat Hex. The rat homologue to mouse 
5 heterogeneous ribonucleoprotein G was also chosen as a marker but was found to be 

downregulated. Two markers involved in the intracellular trafficking were also selected. This 
includes the upregulation of a rat novel related kinesin protein which may be involved in the 
transport of membrane bound organelles and the downregulation of rat r-slyl which plays a role 
in ER to Golgi trafficking [36, 37]. Two markers in this set are involved with cellular 

10 communication. Rat neurotensin endopeptidase was downregulated. Neurotensin is a hormonal 
peptide that functions as a central neurotransmitter or neuromodulator as well as a peripheral 
hormone [20]. Rat densin-180 has a strong association with the postsynaptic density in CNS 
neurons and is believed to function as a synaptic adhesion molecule. Densin-180 was found to 
be upregulated in this in vitro marker set [31]. Rat homologue to mouse sex-determination 

1 5 protein homologue Fern 1 a was upregulated in this marker set but has no known association to 
any of the histopatho logically relevant biochemical or toxicological pathways. Feml is a known 
signal transducing regulator in the C. elegans sex-determination pathway [49]. 

There were 55 markers chosen for this PTS marker set that did not match any known 
genes in the database and have novel composition. There were also two genes that blasted to rat 
20 gene patents and have novel utility. One additional marker, rat Tclone4, had some similarity to a 
known gene but has no known function. 



EXAMPLE 8: Prediction of the Toxicity of a Test Compound 

The following example describes the application of the TOXMARKER expression 
25 profiles generated, as described above, to identify hepatotoxic compounds. 
Hepatocvte Culture 

Animals. Male Wistar Han rats (Crl: WI[Glx/BRL/Han]IGS BR) were obtained from 
Charles River Laboratories, Inc. (Raleigh, NQ. The animals were housed for 6 or 7 days in a 
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temperature-, humidity-, light-controlled facility prior to hepatocyte preparation and were at this 
time 200-250 g in weight. 

Hepatocyte isolation Hepatocytes were prepared by in situ liver perfusion according 
to a protocol used at the Yale Liver Center (Yale University School of Medicine, Yale 
5 University, New Haven, CT). To minimize the risk of contamination all equipment and solutions 
used during the perfusion procedure were autoclaved. The animals were anesthesized by sodium 
phenobarbital (approx. 50 mg/kg) and the abdomen opened to expose the liver. To perform the 
perfusion a catheter was attached to vena porta and secured by a ligature. After disrupting vena 
cava inferior 37 °C Hanks A buffer (120 mM NaCl, 5 mM KC1, 0.4 mM KH2P04, 25 mM 

1 0 NaHC03, 0.5 mM EGTA, 0. 1 % glucose) was circulated through the liver a 40 ml/min for 1 0 
min. The perfusion was continued with 37 °C Hanks B buffer (120 mM NaCl, 5 mM KC1, 0.4 
mM KH2P04, 25 mM NaHCO30.4 mM MgS04, 0.5 mM MgC12, 3 mM CaC12, 0.1% glucose) 
supplemented with collagenase (Liberase Blendzyme 3, 120mg/400 ml; Roche Diagnostics 
Corp., Indianapolis, IN) until clear signs of liver disintegration were visible (approx. 10 min). 

15 The liver was transfered into ice-cold Leibovitz L-15 media (Invitrogen, Carlsbad, CA) and the 
organ was disrupted mechanically with forceps. The cell suspension was filtered through a 80 \im 
mesh followed by a 45 \im mesh, and the medium replaced twice with fresh ice-cold L-15 
medium following 5 min low speed centrifugations (30G). Cell were transported on ice to 
Curagen's tissue culture facility and washed twice as above in William's E media (Sigma, St. 

20 Louis, MO) with supplements (10% fetal bovine serum (Gemini, Woodland, CA); 9.6 ug/ml 
prednisolon, glucagon 0.014 ug/ml, insulin 0.16 units/ml, glutamin and antibiotic-antimycotic 
solution (all purchased from Sigma, St. Louis, MO). Cells were counted and viability was 
measured in a hemacytometer after Trypan blue staining 

Hepatocyte culturing Cells were cultured in 12 or 24 well tissue culture plates coated 
25 with rat tail collagen (Becton-Dickinson, Bedford, MA) according to a protocol provided by Dr. 
Grazyna Wasinska-Kempka at Bayer (Wuppertal, Germany). Cells were seeded at a density of 
approximately 80,000 cells/cm 2 or 270,000 cells/well in 12 well plates and 140,000 cells/well in 
24 well plates and incubated at 37 °C, 5% CO2 for 2 h. To suppress contamination the amount of 
antibiotic-mycotic solution was increased (3-fold) and gentamicin (44 ug/ml; Invitrogen, 
30 Carlsbad,) was added during the seeding stage. After 2 h the media was replaced with fresh 
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William's E media (as above) supplemented with 75 ug/ml rat tail collagen (Becton-Dickinson). 
The media was replaced approximately 16h later with fresh collagen-containing media, with or 
without test compound, and thereafter every 24 h 

Test compounds Test compounds were solubilized at 200-fold final 
concentration in 100% DMSO and diluted into hepatocyte culture media to final concentrations 
ranging from approximately 200 pM to 10 mM depending on the compound. Compound stock 
solutions in DMSO were prepared at the initiation of the dosing regimen and stored at 4 °C. 
Compounds were added to the culture media immediately prior to addition to the cells. 

Control compounds (compounds that do not elicit a toxic histopathogy, see for 
example Table 1 and 2)) are selected from the training set and dosed on each batch of 
hepatocytes along with the unknown compounds subjected to toxicity prediction. The data from 
these controls are used to determine hepatocyte quality over time 

RNA isolation Qiagen's (Valencia, CA) RNeasy 96 kit was used for isolation of 
RNA. For cell lysis the culture media was carefully removed and 400 ul/sample in 12 well plates 
and 200 ul/sample in 24 well plates of RLT lysis buffer supplemented with 10 mM DTT was 
added per well. The RLT buffer efficiently lysed cells and solubilized the collagen layer covering 
the cells. The lysates were homogenized by pipetting 12-15 times, snap frozen and stored at -80 
°C. 

Cell viability In parallel with the compound dosing, a viability assay was 
performed to make sure that the concentrations used to treat the hepatocytes were not excessively 
toxic to the cells. Cell viability was monitored for each compound concentration using the 
CellTiter Assay (Promega, Madison, WI), a modified MTT assay. Cells were seeded in 96 well 
plates coated with rat-tail collagen (Becton-Dickinson) at a density of 27,000 cells/well. After 16 
h in culture cells were treated in triplicate for each compound and concentration. Following 24, 
48 or 72 h incubation in the presence of the drug, the MTS/PMS reagent of the CellTiter kit was 
diluted 1/6 in culture medium, added to the cells and after 60 min incubation of the cells optical 
density at 490 nm was measured using a PowerWaveX Select 96 well spectrophotometer (Bio- 
Tek Instruments, Winooski, VT). Viability was calculated in relation to no-drug control after 
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subtraction of a no-cell background value. Only compound concentrations that showed 70% or 
more viability in this assay were used for gene expression profiling. 

RNA Purification The hepatocytes are harvested in 200 |il of lysis buffer (RLT) 
provided with the Qiagen RNA isolation kit. Total RNA is isolated from the lysates using the 
5 Qiagen RNeasy 96® isolation kit following the manufacture's instructions with some 

modification. An equal volume of 70% ethanol is added to each of the lysates and the samples 
are added to the membrane on the 96 well plate. Membranes are then washed thoroughly (once 
with 800 \x\ of buffer RW1 and twice with 800 jal buffer RPE) to remove unbound material 
followed by DNase I treatment (50 units of DNase I from Promega (10 u/jil) in buffer RDD from 

10 Qiagen; total volume 60 ul) for 1 hr at room temperature to remove all traces of genomic DNA 
that might be co-purifying with the RNA. Following DNase I treatment, the membranes are again 
washed three times as before (once with 800 jal of buffer RW1 and twice with 800 \x\ buffer 
RPE), and dried with a centrifugation step (6000xg for 7 min; to remove all residual traces of 
ethanol from the washing buffers). RNA is then subsequently eluted from the columns with 40 \x\ 

1 5 of RNase/DNase-free water. This process has been automated using the Tecan Genesis Freedom 
robotic system. 

Quantity of RNA is determined by fluorometry using Ribogreen dye from 
Molecular Probes and quantified using a fluorometer (Spectrafluor Plus instrument, Tecan). This 
procedure involves diluting each 2jil sample ten-fold in Rnase-free water and then measuring 

20 fluorescence (after addition of dye). An average of triplicate subsamples is used to calculate the 
concentration and total RNA yield for each sample (by comparison to a standard curve generated 
from known amount of RNA standards). At this point the samples are evaluated as passed or 
failed based on a concentration criteria (60 ng/^1 or more considered as "PASS"). The samples 
that have a concentration greater than 60 ng/|il are further diluted to 60 ng/|xl with DEPC treated 

25 water. 

cDNA Synthesis Double stranded cDNA is synthesized using the Roche cDNA 
synthesis kit, following the manufacturer's instructions, with some modifications. 600 ng of total 
RNA isolated from the hepatocytes (60 ng/ul), are spiked with 2 \xl of reference mRNA (7xl0 6 
copies of hyaB and 2.5xl0 7 copies of mhpR) and this mix is incubated in the presence of 2 ug of 



65 



oligo [(dT) 2 4 T7prom] 6 5 primer at 70°C for 10 min, immediately followed by quick chilling on 
ice. To each sample first strand synthesis mix is added such that the final mix contains IX AMV 
RT buffer, 8.1 mM DTT, 25 units of AMV reverse transcriptase, 12.5 units of RNase-inhibitor 
and a dNTP-mix (1 mM of each nucleotide). This mix is incubated at 42°C for 1 hour followed 
5 by chilling on ice. The second strand synthesis involves the addition of the second strand buffer 
to a final concentration of 1 X, a dNTP mix (80 \xM each) and the second strand enzyme blend 
provided in the Roche cDNA synthesis kit. The mix is incubated at 16°C for 2 hours. Adding 10 
units of T4 DNA polymerase to each reaction and incubating at 16 °C for a further 5 minutes to 
terminate the elongation. The reactions are stopped by adding 1 1 \i\ of 0.2 M EDTA pH 8.0. The 
10 double stranded cDNA is purified using Qiagen's QIAquick™ Multiwell PCR Purification kit, 
following the manufacturer's instructions. The cDNA is then quantified by fluorometry using the 
Picogreen® dsDNA Quantification Kit (Molecular Probes) following manufacturers instructions. 

Microarravs 

15 

In Vitro Transcription, cRNA Purification, and cRNA Quantitation The complete yield 
of double stranded cDNA (minimum 50 ng) is placed in a Centrivap Concentrator (Labconco) for 
2 hours at 45 °C or until liquid is completely evaporated. In vitro transcription is performed using 
Ambion's MEGAscript™ T7 Kit, following the manufacturer's instructions, with the following 

20 modifications. Biotin labels are incorporated during cRNA synthesis by adding biotin 14-CTP 
(Invitrogen) and biotin 16-UTP (Roche Applied Science) to the in vitro transcription reaction to a 
final concentration of 1.5 mM. Transcription reactions are incubated at 37°C during 16 to 18 
hours. Reactions are stopped by adding 2 U of DNase 1, and incubating at 37°C for 15 minutes. 
cRNA is purified using Qiagen's RNeasy 96™ kit, following the manufacturer's instructions. 

25 Purified labeled cRNA is recovered in 30 ul of DEPC-treated water (see STM TS-MAH-104) . 
Labeled cRNA yield and quality are determined by measuring the 260/280 nm optical density 
ratio, using a Powerwave HT spectrophotometer (Bio-Tek). Labeled cRNA is then diluted to 
0.225 ug/ul using DEPC-treated water. 
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Hybridization, and Fragmentation We use a format with two microarrays on each slide 
and dual hybridization chambers consisting of two 0.8 mm height, 22 mm 2 square chambers. 
Dual hybridization chambers are installed on glass arrays using a Slide-Chamber Alignment Tool 
(SCAT). The SCAT is linked to a vacuum pump to create pressure that makes the chamber and 
5 the microarray hold together. Once assembled, microarrays are incubated at 37°C for 10 minutes. 
Slides are then placed chamber side down on a lint free paper towel (Texwipe Company LLC) on 
a flat surface, and using a finger, gentle pressure is applied along the periphery of the chamber. 

Synthetic cRNAs (1 ng each of bacterial araA and ybiW) are combined with 9 ug 
hepatocyte-derived labeled cRNA prior to fragmentation. The cRNA mixtures are fragmented at 

10 94°C for 20 minutes in a 50 ul reaction containing 40 mM Tris-Acetate, 100 raM Potassium- 
Acetate, 31.5 mM Magnesium- Acetate, pH 8.1. After the fragmentation is complete, 300 ul of 
cold hybridization buffer (Mergen Ltd., proprietary composition) is added to the fragmented 
RNA, and kept on ice until ready for loading onto a microarray, 320 ul of cRNA are then loaded 
onto a barcoded microarray. After air bubbles have been removed from the liquid, loading ports 

15 of the chamber are tightly sealed using Mergen adhesive dots. Slides are incubated for 16-18 
hours at 30°C in a hybridization rotisserie oven set at 18 rpm (Robbins Scientific, model 400) 

Microarray Washing and Staining After hybridization for 16 hours, the hybridization 
chambers are removed from the microarrays slowly using forceps. The microarrays are placed in 
a reservoir containing TNT (0.1 M Tris-HCl, pH 7.6, 0.15 M NaCl; 0.05% Tween-20) and 

20 incubated at 39°C for 1 hour. The following staining procedure is light sensitive so all 

incubations are done under foil taking care to minimize light exposure. Following the TNT 
wash, the arrays are incubated in Buffer B (Mergen LTD, proprietary composition) at 4°C with 
gentle agitation for 30 minutes. The slides are then placed in a 1 :500 dilution of 1 mg/ml 
streptavidin-alexa 647 (Molecular Probes) in Buffer B at 4°C with gentle shaking for 30 minutes. 

25 Following a TNT wash of three times 5 minutes each, the arrays are incubated in a 1 :500 dilution 
of 0.5 mg/ml biotinylated anti-streptavidin antibody (Vector Labs) in Buffer B at 4°C with gentle 
agitation for 30 minutes. The microarrays are washed three times in TNT for 5 minutes each. A 
1 : 1 mix of streptavidin-alexa 647 and biotinylated anti-streptavidin antibody is incubated at 25°C 
for 1 hour to encourage complex formation. This complex is diluted 1 :267 in Buffer B and 
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incubated with the microarrays for 30 minutes at 4°C with gentle shaking. The slides are washed 
again in TNT three times for 5 minutes each followed by 2 washes of 5 seconds each in 
redistilled H 2 0. The arrays are washed individually in redistilled H 2 0 five times for 1 second 
each. The microarrays are placed in a slide holder and centrifuged for 15 minutes at 300 rpm to 
5 ensure complete dryness. The slides are placed in a clean dry box and stored at room 
temperature until being scanned. 

Scanning and Analysis of Microarrays Each slide is individually scanned using a 
GenePix 4000B scanner (Axon Instruments Inc.), using the 630 nm laser at 100% power and a 
PMT setting of 600 volts. 

1 0 Image QC and Data Export Tiff images from each slide are analyzed using the GenePix 

4.0 software (Axon Instruments Inc.). The scanned image is first aligned with a grid consisting of 
an array of circular features, such that each spot on the image is contained within a feature. After 
the array is roughly aligned, the software conducts fine alignment of each feature with each spot 
on the array. Additionally the software calculates local median foreground and local median 

1 5 background for each feature. The microarray is then evaluated manually to determine whether the 
features found by the software were true data points and not false intensity due to a defect or 
contamination. If contamination affects more than 30% of a feature, the feature is flagged as bad. 
If contamination affects more than 30% of the local background area of a feature and the local 
foreground/local background intensity ratio is less than 3, the feature is flagged as bad. After 

20 flagging is complete, a GenePix output file is generated. The output file contains spot IDs, spot 
location information, median local foreground, median local background, and bad spot flag 
information. 

A quality control log is kept for each array to document background levels, 
noise/contamination problems as well as the number of features flagged. A final Image QCpass 
25 or fail decision is made for each array and added to the log. An array must contain no more then 
1% flagged spots in order to pass Image QC. If an array contains an area of contamination of 
>150 counts which covers more then 5% of the array then the array fails 

PTS Microarray Data Processing The GenePix output files generated during the Image 
QC and Data Export are processed using Microsoft excel. A macro is used to split the data from 
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each of the two arrays, calculate spot intensities, and filter out low quality data. The following 
procedure is used to calculate gene intensity and filter the data. 

A. Calculate spot intensity 

spotjntensity = median local foreground - median local background 
B, Calculate threshold 

There are 15 probes on the array that are either yeast or mouse negative controls. 

If (spotjntensity is from probe type "yeast jieg" or spot_intensity is from probe_ 
type "mousejieg") 

then spoMntensity = thres_value 

meanjhres = average (thres_value[l], thres_value[n]) 
stdevjhres = stdev (thres_value[l], thres_value[n]) 

If (thres_value < (mean_thres + 3* stdevjhres) and thres_value > (mean_thres - 
3*stdev_thres) 

then thres_value = good_thres_value 

else thres_value = bad_thres_value 

mean_good_thres = average (good_thres_value[l], good_thres_value[n]) 
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if mean_good_thres > 0 

then threshold = mean_good_thres + 5* stdev (localJ)g_median[l], 
local_bg_median[n]) 

else threshold = 5* stdev (local_bg_median[l], local_bg_median[n]) 
C. Data filtration 

If (spotjntensity has no problem spot flag then spot_intensity = good_ spotjntensity 



Only data of type good_spot_intensity is passed to the Discovery department. 

PTS Microarray Quantitative QC The array quality is assessed by calculating a set of 
descriptive statistics and testing if they pass set criteria. In order for array data to be accepted the 
following criteria must be met: 

• Toxicity marker spot intensity trim mean / threshold > 8 

• Maximum spot intensity of blank spots < 300 

• Number of marker spots above threshold > 800 

• Labeling spike mean spot intensity/ threshold > 50 

• Hybridization spike mean spot intensity >50 



Toxicity Prediction 
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The PTS presents a single overall likelihood of toxicity for unknown samples (i.e. the 
probability that a sample is toxic) that can be ranked to indicate severity of toxic insult. In order 
to compute the toxicity likelihood of an unknown we calculate a likelihood estimate from 3 
different modeling types (Classification Trees, Discriminant Analysis, and Logistic Regression) 
using three independent gene lists for a total of 9 models per mode. These 9 models are then 
averaged to provide a likelihood (probability) of toxicity for each mode. The rationale behind 
combining the results of several models as opposed to relying on a single model that performs 
best for a given mode is to control for the risk of over- fit (a model that performs well on training 
data but inadequately on novel samples). The best performing model would be expected to be 
more over-fit than a weaker performer. In order to reconcile this we average the results of 
several models in order to determine the true likelihood of toxicity for an unknown. This section 
attempts to describe the methods we will use for model averaging, explain how compounds can 
be ranked, and how we can draw inferences about the severity of toxic insult. 



Common Terms: 

Likelihood of Toxicity : A value between 0 and 1 indicating how confident we are 
that a given compound/dose combination is toxic. 

Model: A statistical algorithm for prediction. This section focuses on Logistic 
Regression, Discriminant Analysis, and classification trees which are explained elsewhere 
(Appendix H). 

Model Type: Logistic Regression, Discriminant Analysis, or classification trees 

Mode: A specific type of hepatotoxicity (e.g. hypertrophy) 

Present and Absent: Present means a compound produced a given toxicity in vivo 
absent means it did not. 

Model Background 

One obvious problem with model averaging is that different models provide 
different outputs. Discriminant Analysis produces a number without bounds, logistic regression 
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produce a likelihood estimate with a value of 0 to 1, and classification trees a proportion of node 
impurity with a value between 0 and 1 . Therefore an arithmetic mean of these three results may 
be misleading because the scale of discriminant analysis is so different than the other models. 
Our approach is to scale the results of all models to produce a single likelihood, P p 9 the 
5 probability that this sample belongs to the toxicity class as opposed to P 4 , the probability that the 
sample does not manifest itself as toxic for this mode. By definition: 

Equation 1: Pf +Pf =1 

Where j is the j th model (the three models mentioned above). In order to proceed we first 
need to find P p for each model. 

10 

Equation 1 means that our classifications (present and absent) are mutually exclusive and 
collectively exhaustive. If a sample is absent for hypertrophy, it cannot also be present for 
hypertrophy (mutually exclusive) and if a sample is not hypertrophic it must be absent of 
hypertrophy (collectively exhaustive). 

1 5 Logistic Regression: This model returns by design. 

Discriminant Analysis: This model returns a linear discriminant that is a one- 
dimensional linear combination that establishes two separated normal distributions as follows: 

20 

where 'Absent' refers to the theoretical distribution of samples that did NOT produce 
pathology and 'Present' refers to the theoretical distribution of samples that did produce 
pathology. Let Mp = the mean of the linear discriminant function for the training samples 
annotated as present, and M A = the mean of the linear discriminant function for the training 
25 samples annotated as absent. Let Gp and Ga denote the probability mass function for the 
Gaussian distributions of the linear discriminant values for the training samples annotated as 
present and absent, respectively. Having estimated these functions, calculations of percentiles is 
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very straightforward. In the above figure, M P > M A , therefore, for an unknown sample with a 
linear discriminant value of c x\ we have: 



Equation!: P' = Xom^^ 



[UpperTail(G A (x)) + LowerTail(G p (jc))] 



where Lower Tail G P (x) refers to the area under Gp(y) for which y < x, and Upper Tail 
(G A (x)) refers to the area under G A (z) where z>x. When M A >Mp the above equation becomes: 



„ . „ „ p UpperTailiG Jx)) 

Equation 3: P p =■ yF v M " 



[LowerTail(G A (x))+UpperTail(G p (x))] 
10 



Equations 2 and 3 appear more complicated than they really are. They simply convert the 
percentile of (x) belonging to the "present" distribution to a conditional probability that it 
belongs to present and not absent. This is used to satisfy mutual exclusivity rule of equation 1. 
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Classification Trees. As mentioned above, classification trees return a probability of 
correct classification for each prediction. However, this probability is actually a proportion based 
on the node impurity of the classification tree (the fraction of training samples on that leaf that 

20 belong to the majority class, e.g. a leaf contains 9 samples with hypertrophy and 1 sample 

without, the probability returned is 0.9). This is not a good indication of the true probability of 
an unknown because the leaf may have very few members and because the confidences of branch 
decisions are not included in this calculation. In order to convert this proportion to likelihood, 
we simply construct many trees using a subset (n-1 , where n= the number of compounds) of the 

25 compounds in the reference database until all compounds are NOT used once (this is identical to 
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the leave one out cross validation described in Appendix H and in the next section). This process 
is called a jack-knife estimate of confidence. 



Summary: 

This section describes how we convert the results of the three different models to 
a likelihood estimate that satisfies the mutual exclusivity rule of equation 1. For logistic 
regression this is the result, for discriminant analysis we rely on the probability mass function of 
a normal distribution, and for classification trees we create a jack-knife estimate of node 
impurity. Having described these techniques the next section explains how we combine the 
results of the individual models. 

Model Averaging 



Each of the above models makes a decision as to whether a toxicity is "present" or 
"absent", and some models are expected to perform better than others. What remains to be 
1 5 explained is how we determine model quality and how we combine the results of individual 
models. 

a) Model Quality: 

In order to determine how confident a given model is in its decision, we perform 
jack-knife estimates of each prediction. A jack-knife estimate computes n different models, with 
20 n-1 compounds in the training set (where n is the number of compounds used for training a 
particular toxicity mode). Each jack-knife casts a single vote for absent or present. The best 
models confidently (likelihood is much greater or less than 0.5) make the same decision 
consistently, while poorer performing models tend to have equivalent "present" and "absent" 
votes. 

25 b) Combining Models: 

The number of present and absent votes are tallied across all the jack knife 
estimates for each unknown compound. Models that are more confident in there decision are 
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naturally weighted heavier by an ability to cast more votes. The vote totals can then be evaluated 
using the binomial distribution as follows: 



P 

L Tor = S 



f N ^ 



V 1 J 



Tox ~ uo 2 N 



Where L Tox is the likelihood that the observed vote distribution is greater than 0.5, P is 
the number of votes for "present" and N is the total number of votes cast. 

Summary: 

This section explains how we use three different marker sets for each model and then 
10 compute a weighted average based on how consistent the predictions are within a given step. At 
the completion of this exercise each sample will have three sets of votes: one each for logistic 
regression, discriminant analysis, and classification trees. The likelihood for each mode is 
calculated from a binomial distribution, under the null hypothesis that voting is random. 

15 EXAMPLE 9: TOXMARKER Nucleic Acid Sequences 

This example provides exemplary TOXMARKER nucleic acid sequences, useful 
in methods of screening compounds for hepatoxicity according to the invention. 

Table 9 A 

>scr_gb-af038870_4 (TOXMARKER Assignment: 1; SEQ ID NO: 1) 
20 ttttttttttttttttttttgaaggttttcaaccggcatgtttttattaatgaaatggaa 
tggaagcagtcagaacagagattacagaattacagaatggatcagttatctgttaagttt 
tacagggctggtgtgtgttgtttctgcctaagggtcctgctcaaaagatcttggaatcca 
cttgggaagcatcttagatatagatggttgctgtgtcacttatgatacggtccctgaatg 
gttctatgtcactcgtggaggtggtgtcctatccccctatctgaaatgagattgacgtcg 
25 ggtgactttctcttcgctgcagtgactcctgtgcgcctgtaatgcgacaggcacgtagga 
aatgtgttcaggatttactgtggacttctcctttcttccttctaggtaaaattctaaagc 
gtagttttgtaactgtgaaatgctatctgtgactccattttgtctaactagcaccaatca 
caggtgtaagccggcatcaacacaaacgctggtttagagatgccttctccttccgggtgc 
acactgtggcccggacctggaggaattcgccccgaaccgctggcctgtggctactgtgcg 
30 gatttgaatttttgtttttcgaagagcgctctcagctgctgctcagtggtggcttccttc 
tgctgcatcagctctgctgcccctttcgtcactccccaagcatccggcttggacatcgaa 
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ggattgtacggtctgccggaagctattcgaagattctgccagtattctttcctggccctt 
gccctgatccagggtttggtgtgcatgtccaaaccacttccccagctgccatgtttttct 
gaagctggtggtaaaaatcccctttctggggcgagctcctctgcaatggccctgatgtgg 
tagggctcaaatccgcagcagccgccaatgtacctgacccccaggttgtaggcctctctg 
gcgtatttttgaatatcccatctggtggcaactctgggttccaatccaaaggggaattct 
gggagatcaataaatccctgtttgccacagtcaggggtgtggtaggccaggggctggctc 
atcaagtaagccttcagccgagctgcttccagaccctccttcatgagctttattgtctgc 
aagctggtgctggggtcgaagtggcagttcacaccgacaatggcggcacctgcttttacc 
aaacgcactgcgcactctccaggagacacgccatgtagatctccttcaggtccgatgcac 
atggtagccgctataggcttcccggatgtttttaaggcctcgactgcccacacggcttct 
tcaacatgttcaaaatactctgcaatgaggaagtccacattcttcttcatgaagacctca 
agctgttggtgaaatatctttttaacttccgtctcactcttgcagctgaggtaggaaggt 
gtctgactcacacctcctgcaaccaatgcatccccttcgtcagcaacttgccgtgcaatg 
tcacaagcagcttcattgaccttctgcccagatatcttctctgccacgtagttccctcgg 
ttttccagcttgtcctcacttgcatagaaagtgaaggtctgcatgacgttcgatccagct 
ctgaggaactcccgatgaagctgccgaactgcctcggggtgctccaccgcagcctctggg 
gtccagggtccagcctttacgtagcccctcttttccagtgcaaagacaaatcccccatct 
ccgatcacgacttcgccagcatttaagcgttctaagattcccctcttggccttcttgccg 
gcaatcggtgccatctttccggtgtcctgagtggcgctgaacgcagctgcggactggaca 
ggagcggtctccagcaaaggcttgactgctgagccgcttctggcctctttatatacagca 
gctaggattccccagccttgaccgggtccaacacatggcctcaggcggggaacacgccca 
ccagcctttgaaacaggcctggggctagctgggaattc 

>scr__gb-z83053_3 (TOXMARKER Assignment: 2; SEQ ID NO: 2) 

gacatggcaccagccggaggcccacgagtcaagaagggtatcttggagcgtctggacagc 

ggggaggttgtggttggggacggcggctttctcttcactctggaaaagagaggctttgtg 

aaggcaggactttggactccagaagcagtggtagagtatccaagtgcagttcgtcagctt 

cacacagaattcttgagagcgggagccgatgtcttgcagacattcaccttttcggctgct 

gaagacagaatggaaagcaagtgggaagctgtgaatgcagctgcctgtgacctggcccag 

gaggtggctgatggaggggctgctttggtggcagggggcatctgccagacatcactgtac 

aagtaccacaaggatgaaactagaattaaaaacattttccgactacagctaggtgttttt 

gccaggaaaaatgtggacttcttgattgcagagtattttgagcatgtggaagaagccgtg 

tgggctgtggaagtcttgagagaggtgggggcacctgtggctgtgaccatgtgcatcggc 

ccagagggggacatgcacggcgtgacaccgggagagtgtgcggtgagactgtctcgtgca 

ggggcgaacatcattggggtaaactgccggtttgggcctggaccagcttacaggaccatg 

agctcatgaaggagggcctcagggattgcggcctactagctcaccttatggtccagtgct 

tgggttttctcacactgggactgtggcaagggagggttgtggacttcctgatatcctttt 

cgcctggggcaagagttgccaccagatgggatattcaaaaatacgccagagaggcctaca 

acctgggggtcaggtacattggcggctgctgcggatttgagccctaccacatcaggggcc 

attgcagaggagctcgccccagaaaggggatttttgccaccagcttcagaaaaacatggc 

atctggggaagtggtttggacatgcacaccaaaccctggatcagagcaagggctagacgg 

gaatactgggaaactctgttgccagcttcgggaagacctttctgtccttccctatcaaag 

ccagatgcttgagaagccatgaaagagacctctgaagtgacagaaaggaggaaacagcct 

caagccccatctggaatcttcctggctgctgtcctcagcccgttcttctggctgttgagc 

atcgatgagctgtcgtcccttccaattgagtgacatatcactcctgagtatgcccactag 

atgcggtggagatgcagaggcatccggaccccacgccccaccccctcccctcacacactt 

actctctgcctagtaatgccacagagcttccatccccatccaaaggtcatcaggcatggc 

tatcagttggctctcagggtggatttgacattctcagatgattagaagttggcaagaagc 

aaccttggtgaataactctggtgtctaaactctgtacttgagttacagtctcagtagagg 

agacgccaaagctgttgcgagtgacggcagaattattgaacagtcatgatgcttggcttt 

caaaggcgattatcgctttaaggtcttagaattagtaagtgcatctttataaccaggcat 

agctagatcataaactactgatggccaaggaccatagaacgtgcttcttaccttcctctc 

tagttagcattacgacaaacataatcaccaacgctcagggaaacacttgctgattcaagt 

aaaatgcatgaaccttggaagacctttctagaagtcagagatcaagttcatcttgttcta 

gcactttccacattcatgtttggtttgtatgctgcgccctacttttgttttttgctacaa 

tgtaacaaattagtgagtaaccattagtgaaattgcgaataattttccttttctaaattt 

tgatttctttggaacattgatttaaaaaaaatagtgtgttgcttgtcaaaaaaaaaaaaa 

aaaa 

>scr_gb-x95189_4 (TOXMARKER Assignment: 3; SEQ ID NO: 3) 
ccatagcgaagacttcatgaagactgtcccaggcatgctgtgacacaaactacagaaggt 
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gggaaaagatctttgtggtcaaaccatccggaccttggctaccgcagacagaacaatact 
gaccgcattcactcatacacagttctcggcacctcccagtgctcagagcagaccctcaag 
gagatgagcagatccaggatggggagcccaatgcaccgagtgtccctgggggacacctgg 
agctggcaagtgcacccggacatagacagcgaaaggcactcaccgtccttcagtgtggag 
cgactcaccaacatccttgatggaggcctcccaaacaccgtgctgcgaagaaaagtcgaa 
agcatcatacaaagtgacccagtgtttaatttgaagaagctttacttcatgacccgagag 
gagctatatgaggatgcgattcaaaagagattccatctcgagaagctagcctggagcctg 
ggctggtcagaagatggtcctgaacgcatttatgctaacagagtccttgatggaaacgtc 
aacttaagcttacatggtgttgccatgaatgctatccgaagcctgggctcagatgaacag 
attgctaaatggggccaactctgcaaaaacttccaaatcatcacaacatacgcccagaca 
gagctgggacacgggacatacctacagggcctggagactgaagccacctatgatgaagcc 
aggcaggagcttgtgatacacagccctacgatgacttccaccaagtggtggcctggggac 
ttgggatggtcggtcacccatgctgtggtcctagcccagttgacctgcttaggagtccgg 
cacggcatgcacgccttcattgtgcccattcggagcctagaggatcacaccccactgcca 
ggaatcacagttggggacataggccccaagatgggtttggaacacatagacaatggcttc 
ctgcaactgaaccacgtgcgggttcccagagaaaacatgctcagtcgctttgcagaggtc 
ttgccagatggtacctaccagaggcttgggacgccacagagcaattatcttggcatgttg 
gtgacccgggtgcagctgctgtgtaaaggaatcctaccctccctccagaaggcttgcatc 
attgccacgcgctactcagtaatccgccatcagtctcgacttcggcccagtgacccagag 
gcaaaaatcctggaataccagacgcagcagcagaaactccttcctcagcttgctgtgagc 
tatgccttccacttcacggccaccagcctctcagaattcttccacagctcctacagtgct 
attctgaagagagacttcagcctcctgcctgagctccatgcattgagcactggtatgaag 
gccacgtttgcagacttctgtgcccagggcgccgagatctgtcgcagagcttgcgggggc 
catggctactcaaagctgagcggcctgccgacactggttgctcgagcaacagcctcttgc 
acatatgagggtgagaatacggtgctctacctgcaagtggccaggtttctgatgaagagc 
tatctgcaggctcaagcgtccccaggcgccacaccacagaagcctctccctcagtccgtc 
atgtatattgccacacaaaggccagccaggtgctcagcccagactgcagctgacttccgc 
tgccca^atgtctataccacagcctgggcatatgtgtctaccaggctcataagagatgca 
gcacaccgtacacagaccctcatgaagtccggggttgaccagcatgatgcctggaatcaa 
actactgtcatccaccttcaggctgctaaggctcactgctacttcatcactgtgaagaat 
ttcaaggaagctgtggagaaactagacaaggaaccagagattcagcgtgtgctccaacgc 
ctctgtgacctctatgccttacacggtgttctgactaactcaggggactttctgcatgat 
ggcttcctgtctggggcccaggtggacatggccagagaagccttcctagacctgcttccc 
ttgatccggaaggatgccatcttgttaaccgatgcttttgacttctcggaccattgttta 
aactcggcacttggctgttatgatggacacgtctacgaacgcctgtttgagtgggctcag 
aagtacccagccaatactcaggagaaccctgcctataagaagtatatccgaccactgatg 
ctcggctggagacacaagatgtgaaaagtcaaaggatttgggaccgagaagcaccacggc 
cttactatggcacatatacatagagaatttaaagcacggggggggggggggggggggtgc 
tgctcggttaaatcaggtagtaaattggtacatgaatggatggtcatcctattagtctac 
tattgagcatgtttgaaactttcccttgtccatctatagcatgtatttggctaaatgcta 
aaatttttgttttacatacaggaaaagctaataaacttgtcagttacaaa 

>scr_gb-m59814_4 (TOXMARKER Assignment: 4; SEQ ID NO: 4) 

tttttttttttttttttttttttttttttttttttttttttaacaatgagacatatacag 

ctttatttaacctgtaaaaagtcacactctgcagagtgacacctttcttatctcagcaga 

aagcaaggagtgtgtgaaaaaccttttcctcaggttgggaaccgtatgaccctggctggg 

ctcacatgtggatccttccagagtccttgtgtgtggcagcttcttcccagaggtctccct 

ggctggtgtgacccctcaccaacaacagacaggggggcaaaatatttctacctggacaag 

gctgccctgagattgtccctttccctcctattaagggacattacatgcttaagaccttcc 

cagaaaagtcaccttcaaggtgacttggctttcatcatgtctgctgacacttaggctcca 

cttatttaccatgatggtgtgtgctaacggtccttcctcttccaataacctcaccatcga 

tggcattttaaatatcactctgttctctgggaccgagggatggagaaccgctctccctca 

gaccaggttttgactcaggagctgggttttattttgaagaaacttccctacatgagtcat 

gagcaagggaaatggatgtgggggagggaggaggggctctgagggaggagtacgaatgga 

ggaaagaaaagaatgtcattggcgagggagagcatggcacagcccagggcttccctctct 

tccctccacctccttcctttcttcctgcagacggggaactccagtccctctcagatggga 

actgagttcaccctggttcccaacgcatacggtttcagcttcgcttctgtttagcatcac 

ctttctctgtctttatcgtcaatcattacgcgtttggtttcccacggcttctacacactt 

ccatggccgagaaatggcggttgcccatgggcagcaggtccagttcattcttcacaggtg 

ggaagttgtttctcagccaagaagctgatctttctggcacattccaccgtggtcaacctc 

tgtttcccctttgaccctggtccttttcattcctctcctcccctaggaacatcgagttct 
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catgccattaccgacggtgactggttcatctggaccctcatcgagtggatgctgctaaga 
atcttcttctgatggcctgccaaggtgacccctattctcaggaggtcttctgatgtcatc 
tgggtgaccagctggagggaggtgaagccagcggtgaggaagctgtccctgtactggacc 
attttgatggcacttagccagtcatccacggtggtaaaggccgtgaagtctgggatagag 
5 cggtcaagcaggggttgggaaggcacagcggtgatggttgccacagtcttgagactagct 
gggttccggatcatcttgtccagggtgttgacgatctctgcaaaacggggccggctattt 
cgatccttctgccaacagtccagcatgagctggtgcagggcagctgggcagtccatagga 
gggggcagccggtagtcctgctcaatggcattgatgacatcttgattggacatatcccag 
taaggtctctctccaaatgacattacttcccacatgacaatcccgtagctccagacatcg 

10 ctggctgacgtaaacttgcggtaggcgatggcctctggagctgtccatctaacaggtatc 
ttccctcccaaggagctggtgtaggtggggtctgaggtgtcatcctggaggtagcgagag 
aggccaaagtcagacactttgcacaccaggttgctgttcaccagaatgttcctagcagcc 
aggtcccggtgcacataattcatctcagataggtacttcatgccagcagcgatgcccctc 
agcatccccacaagctggatcacggtgaactgtccgtcattttgccggaggaaagagtct 

15 aaagcgccattctccatgaactccgtaatgatcatgacaggtcggctcttggtgacaaca 
ccctctaggcgaatgatgttgggatggtcaaactggcccatgatgctcgcctcgctcaga 
aaatcccgacgctgtttctctgagtacccagctttcagggtcttgatggccacatagatt 
tccctcttgcctggcagcttcaatcggcccttgtacacttctccaaactcccctgctccg 
atgacctcttcaattttcacaaaagacacatcaatctccttggcaaactcccggacagct 

20 tcattagggtcctcataagtgaacgggtcaatgtagatcttcatccctggggagcctcgg 
cctgtgctgtaatgctgaagtttatcactgtacacagcctctttgctgtaagctcgtttc 
ctgctgcagacaatggagatagccaccagagacacaacaaatacaaccccagctgctgca 
gagccagcgatcaggggtagctgctctctcagctccgacttgtaatcatcatctgtcaga 
gtctggaagcacatcttgccactgaacttgccatagccagccacggttcgagctcgtacc 

25 tggaccacatacaccatgccgggccgtagcccatcgatacgtgccgtgttggtctggctc 
ctggccatggaagagttgaactcattgtgctccttctcatagtaccggatctcatagtcc 
aggatgatgccattaggctgctccggctgaggccatgacaaggtgatgctcctcatggtg 
gcactgacctggtgcatgataggaacagtggagggggcagcttggtttgtggtgatgttg 
acagagacatgctgtggggggaagggactcttgctagagactccattgatggcctggata 

30 tcaaaagtgtatggggtgtgggcccataggctactgatagagacacgacactcagtcaag 
cccagctgtctgggtacaaactccacattgtcatcgcagcgggagcaactccggcggtct 
gctctgcacttcttgcagatgatgttgtaggtcacatcatctcgcccaccggtctctctt 
ggagggtgccactctagaatgatagatgtctcattcacaatggagatgacatttcgaggg 
cctgatgggacactagtgcacgccacttctgggggatcaaagtctgctcggtaatagcca 

35 gtccggcaggtgcagatgggagacgcctctgaaggggagcggctgttggaggggcagtgg 
gagcagccttcagcttcctggctggccttgaaggttcccgcaggacaggccttgcaggcc 
acgctgttctcaggttcatagccagccttacaggtgcagcgcccaatgggcaccatccac 
tctccatctccattgcagtagagttttatgggcacatccacttcttctgcattagggatg 
catgtgccccgagcaatcaccagagatgtgctctctgctcctgtcatggtttctgggaac 

40 actgcaaaattttgcacaatgctgggacactttttgaagaagacacggacagaaagtaga 
gacatacaggctccataatcctggaaagcgaggtaaaaaccattcctagtaagaggccca 
aagctcctgacttctgtgttgaccttcatcaaccttcccccaaaatccacctgggagaag 
ctctcatctgcagcaatggtgtcaactttgaggtagggggcttcagaccagaaggctgac 
ttcttggtggcaatgacagagtcagtctcatagtagtataagttgaaggtctctttgcag 

45 gagcctgggacatttggaaggctgctgcagtccctcacagtgaagcgcatctctgtatag 
atgcgatgggcgccccgtctgttgataaaggtggtaagcagccagttgttctggttgggt 
tcaaagacgttgcacacttggtaagtacggatggtgttcaggttttcatcgtagccactg 
acttcttcccacccagaggcagggttggccgtccatcccaactctgcagtggcagtcctt 
gtgtccatcaatgtttcttccatcgcggccactgcagatgccaggaggaacagcagcagg 

50 caatccagggccatcgccggccagcggcccccaggccgagccccagcggagacgcgccgc 
gtcccagggcgccgctgcgctcccggcgggtggcttctccgtgtcctttcgcgctctggc 
cgggaccggactccccggagcgcggcgtgggcgtgggcgggagtgtgcgcgcgtggggcg 
gtgcgggcgcgcgtggatgtgggtgtgcatgtgtgtgtgtgtgtttatgggagaggtggg 
tgtgtgcgtgcgtgtgtgagagagggtgagggagagcgagccaaaccataaaaagatgga 

55 gggggagttgtgggtgggcgaccctgctagtttcatagctggcattcttggggctggaaa 
ccccatggcacaagacgttaggatggctggtctgctcaaccactgtgccgtgtgtgaggg 
gtctctcggcttgtgtctctatcctgctctcattgagtcggatgacctgtacagctctgt 
ctaccatggaggatgtattgtgaagtctctgtgctaaggactcacgtttgggtgctttgg 
agatgaaatggatgacatgtacactggatatccccctcgtg 

60 

>scr_gb-m29358_5 (TOXMARKER Assignment: 5; SEQ ID NO: 5) 
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ccccccctcgaggtgttttctttcatttcattccttgtctttagggcttttttttttttc 
aaggtctcattatttatttgttactctttaaagacttatttttgactggactcagattta 
gaagtagaagctctcagcgaagacagcctacgtctcttggcaatctgttcctggcgcttc 
tctttggcttccttcattctcttggccaaaagtttagcatattctgcagcctcctccttg 
tttttcttagtgcgttgcttcttcagagcaatacgtcggcgtttgtgttgcaggacacgg 
ggagtaacaagacgctgaatcttgggcgctttggtcctgggcttcttaccttctttgttt 
aagggctttctgacaacatactggcggacatcatcttctttggagagattaaaaagcttt 
cggattctactagctcttttaggtcccaaccgacgaggcacagtggtatctgtcagtcct 
ggaatatccttctctcctttttttacaataaccaagttgagaacactcaggttggcatcc 
acaatgcatcctcggacagacttgcgcttcctctctccagttctcctaggtctataacaa 
gaatgccccttactcaaaagcaggcgcactctgccatgggtcaaaacgccttgcttcatg 
ggaaaaccttgtttgtcattcccaccgctgatccggaccacataacccttccactcttca 
ccaagagcatcagcagctacttctgtggccatgcgcttctcatagaacgtacgaagcttg 
cgttcgtcatccacttctatgagtttctgacagccagtggcagggaaggagatattcagc 
ttcatcttgacacagccgaccgcctaggaggcgtgttaccattctgatgttggagcggcc 
gc 

>aj 297736 (TOXMARKER Assignment: 6; SEQ ID NO: 6) 

agttgcttcagtgtcccggtgcggttagtcacgtttcgtgcgtgctcattctgccaagat 
gcctgaggaaacccagacccaagaccaaccaatggaggaagaggaggtcgaaacctttgc 
ctttcaggcagaaattgcccagttaatgtccttgatcatcaacactttctactcgaacaa 
agagatctttctgagggagctcatttccaactcctcagacgctctggataagatcagata 
cgagagcttgaccgaccctagtaaactggactcggggaaggagctgcacattaatctcat 
tcccaacaagcaagaccgaaccctcactattgtggatactggcattggaatgaccaaggc 
tgacttgatcaataaccttggcactattgccaagtcaggcaccaaagccttcatggaggc 
tttgcaggctggtgcagatatctctatgattggccagtttggtgttggtttttactctgc 
gtatttggttgctgagaaagtgactgtcatcaccaagcataatgatgacgagcagtacgc 
ctgggagtcctcagctggaggatccttcactgtgaggacagacacaggtgaaccaatggg 
tcgtggaacaaaggttatcttgcatctaaaagaagaccaaactgagtatttggaggaaag 
gagaataaaagaaattgtgaagaaacattctcagtttattggctaccccattactctctt 
tgtggagaaggaacgtgacaaggaagtcagtgatgatgaggctgaagaaaaggaagagaa 
agaggaagagaaagaaaaagaagaaaaggagtctgatgacaagcctgaaatagaagatgt 
tggttctgatgaagaagaagaagagaagaaggatggtgacaagaagaaaaagaagaagat 
aaaggaaaagtacattgatcaagaagaactcaacaaaacaaagccgatctggaccagaaa 
tcctgatgacattacgaatgaagaatacggagagttctacaagagcttaaccaacgactg 
ggaagaacatttggcagtaaagcatttttctgttgaaggacaattagaattccgggctct 
tctttttgtcccaagacgcgctccttttgatctatttgaaaacagaaagaaaaagaacaa 
catcaagttgtatgttcgcagagtttttatcatggataactgtgaggagttaatccccga 
gtatctgaatttcatcagaggggtggtggattctgaggatctccctctaaatatttcccg 
tgaaatgctgcaacaaagcaaaattctgaaagttatcaggaagaatttggtcaagaaatg 
cctagaactatttactgaactggctgaagataaagagaactacaaaaagttttatgagca 
gttctcaaaaaatataaagcttggaattcatgaagactctcaaaatcggaagaagctttc 
agagctgttgagatactacacatctgcttctggggatgagatggtttctctgaaggacta 
ctgcaccagaatgaaggaaaaccagaagcacatctattttatcacaggtgagaccaagga 
ccaggttgctaactcagcctttgtggaacgtctccgaaagcatggcttagaagtaatcta 
tatgattgagcccattgatgagtattgtgtgcaacagctgaaggaatttgagggcaagac 
cttggtgtcagttaccaaagaaggactggaacttccagaagatgaagaggaaaagaagaa 
acaggaagagaaaaagacaaaatttgagaacctctgcaaaattatgaaggatattttaga 
gaaaaaggttgaaaaggtggttgtgtcaaaccgattggtgacatccccatgctgtattgt 
cacaagcacatatggctggacagcaaacatggagagaatcatgaaagctcaagccctcag 
agacaactcaacaatgggttacatggcagcaaagaaacacctggagataaaccctgatca 
ctccattattgaaaccttaaggcaaaaggcagaggctgacaagaatgacaagtctgtgaa 
agatctggtcatcttgctgtacgaaacagcactcctgtcttccggcttcagtctggaaga 
tccccagacccatgctaacaggatctacaggatgatcaagcttggtctaggtattgatga 
ggatgatcctactgtggatgataccagtgctgctgtaactgaagaaatgccacccctgga 
aggagatgatgacacatcacgcatggaagaagtagactaggcttcaccagaactatgtgt 
ttgatgcttaccttcattccttctgataatatattttccatgatttttgtttatttttgt 
taacatttaaaacatctgtgtggcatgaaaactaggggaaggtaaaaatttctacatgtg 
atactgtgatactataggtttgactcaagaggttgatagaacgtttgttgtaagacgtaa 
tgtaacctacggtacttgttaactatgggggtctgaaagtgtttagctgttgagctggat 
tcctttagtagaccaaattaagatgacttaagtttcatct 
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> j 0 0 7 1 9 (TOXMARKER Assignment: 7 ; SEQ ID NO: 7) 

ttgctcctccttgctctcctcgtgggcttcttgttactcttagtcaggggacacccaaag 
tcccgtggcaacttcccaccaggacctcgtccccttcccctcttggggaacctcctgcag 
ttggacagagggggcctcctcaattccttcatgcagcttcgagaaaaatatggagatgtg 
ttcacagtacacctgggaccaaggcctgtggtcatgctatgtgggacagacaccataaag 
gaggctctggtgggccaagctgaggatttctctggtcggggaacaatcgctgtgattgag 
ccaatcttcaaggaatatggtgtgatctttgccaatggggaacgctggaaggcccttcgg 
cgattctctctggctaccatgagagactttgggatgggaaagaggagtgtggaagaacgg 
attcaggaggaagcccaatgtttggtggaggaactgcggaaatcccagggagccccactg 
gatcccaccttcctcttccagtgcatcacagccaacatcatctgctccattgtgtttgga 
gagcgctttgactacacagaccgccagttcctgcgcctgttggagctgttctaccggacc 
ttttccctcctaagttcattctccagccaggtgtttgagttcttctctgggttcctgaaa 
tactttcctggtgcccacagacaaatctccaaaaacctccaggaaatcctcgattacatt 
ggccatattgtggagaagcacagggccaccttagacccaagcgctccacgagacttcatc 
gacacttaccttctgcgcatggagaaggagaagtcgaaccaccacacagagttccatcat 
gagaacctcatgatctccctgctctctctcttctttgctggcactgagaccagcagcacc 
acactccgctatggtttcctgctgatgctcaagtacccccatgtcgcagagaaagtccaa 
aaggagattgatcaggtgatcggctcacaccggctaccaacccttgatgaccgcagtaaa 
atgccatacactgatgcagttatccacgagattcagaggttttcagatcttgtccctatt 
ggagtaccacacagagtcaccaaagacaccatgttccgagggtacctgcttcccaagaac 
actgaagtgtaccccatcctgagttcagctctccatgacccacagtactttgaccaccca 
gacagcttcaatcctgaacacttcctggatgccaatggggcactgaaaaagagtgaagct 
ttcatgcccttctccacaggaaagcgcatttgtcttggcgaaggcattgcccgaaatgaa 
ttgttcctcttcttcaccaccatcctccagaacttctctgtgtcaagccatttggctccc 
aaggacattgacctcacgcccaaggagagtggcattggaaaaatacctccaacgtaccag 
atctgcttctcagctcggtgatccggctgaggcagccaggtgccccagttctgttgggaa 
tggcctcatgtttctgcctctgggggacctgctgaaaaccaggctccaaggccactgctc 
cacatct 

> j 0072 0 (TOXMARKER Assignment: 8; SEQ ID NO: 8) 

cccagtgcccttttgtcctgtgtatctgtttcgtggtgtccttgccaacatctatggtgt 
gggtaagggaatgaggagtgaatagccaaagcaggaggcgtgaacatctgaagttgcata 
actgagtgtaggggcagattcagcataaaagatcctgctggagagcatgcactgaagtct 
accgtggttacaccaggaccatggagcccagtatcttgctcctccttgctctccttgtgg 
gcttcttgttactcttagtcaggggacacccaaagtcccgtggcaacttcccaccaggac 
ctcgtccccttcccctcttggggaacctcctgcagttggacagaggaggcctcctcaatt 
ccttcatgcagcttcgcgaaaaatatggagatgtgttcacagtacacctgggaccaaggc 
ctgtggtcatgctatgtgggacagacaccataaaggaggctctggtgggccaagctgagg 
atttctctggtcggggaacaatcgctgtgattgagccaatcttcaaggaatatggtgtga 
tctttgccaatggggaacgctggaaggcccttcggcgattctctctggctaccatgagag 
actttgggatgggaaagaggagtgtggaagaacggattcaggaggaagcccaatgtttgg 
tggaggaactgcggaaatcccagggagccccactggatcccaccttcctcttccagtgca 
tcacagccaacatcatctgctccattgtgtttggagagcgctttgactacacagaccgcc 
agttcctgcgcctgttggagctgttctaccggaccttttccctcctaagttcattctcca 
gccaggtgtttgagttcttctctgggttcctgaaatactttcctggtgcccacagacaaa 
tctccaaaaacctccaggaaatcctcgattacattggccatattgtggagaagcacaggg 
ccaccttagaccccagcgctccacgagacttcatcgacacttaccttctgcgcatggaga 
aggagaagtcgaaccaccacacagagttccatcatgagaacctcatgatctccctgctct 
ctctcttctttgctggcactgagaccggcagcaccacactccgctatggtttcctgctca 
tgctcaagtacccccatgtcacagtgaaagtccaaaaggagattgatcaggtgattggct 
ctcacaggccaccatcccttgatgatcgtaccaaaatgccatacactgatgcagtcatcc 
acgagattcagaggtttgcagatcttgccccaattggtttaccacacagagtcaccaaag 
acaccatgttccgagggtacctgctccccaagaacactgaggtgtatcccatcctgagtt 
cagctctccatgacccacagtactttgaccatccagacaccttcaatcctgagcacttcc 
tggatgccgatgggacactgaaaaagagtgaagcttttatgcccttctccacaggaaagc 
gcatttgtcttggcgaaggcattgcccgaaatgaattgttcctcttcttcaccaccatcc 
tccagaacttctctgtgtcaagccatttggctcccaaggacattgacctcacgcccatgg 
agagtggcattgcaaaaatacctccaacgtaccagatctgcttctcagctcggtgatcgg 
gctgag 
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> j 0072 8 (TOXMARKER Assignment: 9; SEQ ID NO: 9) 

atggagcccagtatcttgctcctccttgctctccttgtgggcttcttgttactcttagtc 
aggggacacccaaagtcccgtggcaacttcccaccaggacctcgtccccttcccctcttg 
gggaacctcctgcagttggacagaggaggcctcctcaattccttcatgcagcttcgcgaa 
aaatatggagatgtgttcacagtacacctgggaccaaggcctgtggtcatgctatgtggg 
acagacaccataaaggaggctctggtgggccaagctgaggatttctctggtcggggaaca 
atcgctgtgattgagccaatcttcaaggaatatggtgtgatctttgccaatggggaacgc 
tggaaggcccttcggcgattctctctggctaccatgagagactttgggatgggaaagagg 
agtgtggaagaacggattcaggaggaagcccaatgtttggtggaggaactgcggaaatcc 
cagggagccccactggatcccaccttcctcttccagtgcatcacagccaacatcatctgc 
tccattgtgtttggagagcgctttgactacacagaccgccagttcctgcgcctgttggag 
ctgttctaccggaccttttccctcctaagttcattctccagccaggtgtttgagttcttc 
tctgggttcctgaaatactttcctggtgcccacagacaaatctccaaaaacctccaggaa 
atcctcgattacattggccatattgtggagaagcacagggccaccttagaccccagcgct 
ccacgagacttcatcgacacttaccttctgcgcatggagaaggagaagtcgaaccaccac 
acagagttccatcatgagaacctcatgatctccctgctctctctcttctttgctggcact 
gagaccggcagcaccacactccgctatggtttcctgctcatgctcaagtacccccatgtc 
acagtgaaagtccaaaaggagattgatcaggtgattggctctcacaggccaccatccctt 
gatgatcgtaccaaaatgccatacactgatgcagtcatccacgagattcagaggtttgca 
gatcttgccccaattggtttaccacacagagtcaccaaagacaccatgttccgagggtac 
ctgctccccaagaacactgaggtgtatcccatcctgagttcagctctccatgacccacag 
tactttgaccatccagacaccttcaatcctgagcacttcctggatgccgatgggacactg 
aaaaagagtgaagcttttatgcccttctccacaggaaagcgcatttgtcttggcgaaggc 
attgcccgaaatgaattgttcctcttcttcaccaccatcctccagaacttctctgtgtca 
agccatttggctcccaaggacattgacctcacgcccatggagagtggcattgcaaaaata 
cctccaacgtaccagatctgcttctcagctcggtga 

> 1 0 0 3 2 0 (TOXMARKER Assignment: 10; SEQ ID NO: 10) 

atggagcccagtatcttgctcctccttgctctccttgtgggcttcttgttactcttagtc 
aggggacacccaaagtcccgtggcaacttcccaccaggacctcgtccccttcccctcttg 
gggaacctcctgcagttggacagagggggcctcctcaattccttcatgcagcttcgagaa 
aaatatggagatgtgttcacagtacacctgggaccaaggcctgtggtcatgctatgtggg 
acagacaccataaaggaggctctggtgggccaacctgaggatttctctggtcggggaaca 
atcgctgtgattgagccaatcttcaaggaatatggtgtgatctttgccaatggggaacgc 
tggaaggcccttcggcgattctctctggctaccatgagagactttgggatgggaaagagg 
agtgtggaagaacggattcaggaggaagcccaatgtttggtggaggaactgcggaaatcc 
cagggagccccactggatcccaccttcctcttccagtgcatcacagccaacatcatctgc 
tccattgtgtttggagagcgctttgactacacagaccgccagttcctgcgcctgttggag 
ctgttctaccggaggttttccctcctaagttcattctccagccaggtgtttgagttcttc 
tctgggttcctgaaatactttcctggtgcccacagacaaatctccaaaaacctccaggaa 
atcctcgattacattggccatattgtggagaagcacagggccaccttagacccaagcgct 
ccacgagacttcatcgacacttaccttctgcgcatggagaaggagaagtcgaaccaccac 
acagagttccatcatgagaacctcatgatctccctgctctctctcttctttgctggcact 
gagaccagcagcaccacactccgctatggtttcctgctgatgctcaagtacccccatgtc 
gcagagaaagtccaaaaggaggttgatcaggtgatcggttcacaccggctaccaaccctt 
gatgaccgcagtaaaatgccatacactgatgcagttatccatgagattcataggttttca 
gatcttgtccctattggagtaccacacagagtcaccaaagacaccatgttccgagggtac 
ctgcttcccaagaacactgaagtgtaccccatccggagttcagctctccatgacccacag 
tactttgaccacccagacagcttcaatcctgaacacttcctggacgttaacggggcactg 
aaaaagagtgaagctttcatgcccttctccacaggaaagcacatttgtcttggcgaaggc 
attgcccgaaatgaattgttcctcttcttcaccaccatcctccagaacttctctgtgtca 
agccatttggctcccaaggacattgacctcacgcccaaggagagtggcattggaaaaata 
cctccaacgtaccagatctgcttctcagctcggtga 

>ml 12 5 1 (TOXMARKER Assignment: 1 1 ; SEQ ID NO: 1 1) 

cccagtgcccttttgtcctgtgtatctgtttcgtggtgtccttgccaacatgtatggtgt 
gggtaagggaatgaggagtgaatagctaaagcaggaggcgtgaacatctgaagttgcata 
actgagtggaggggcggattcagcataaaagatcctgctggagagcatgcactgaagtct 
accgtggttacaccaggaccatggagcccagtatcttgctcctccttgctctccttgtgg 
gcttcttgttactcttagtcaggggacacccaaagtcccgtggcaacttcccaccaggac 
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ctcgtccccttcccctcttggggaacctcctgcagttggacagagggggcctcctcaatt 
ccttcatgcagcttcgagaaaaatatggagatgtgttcacagtacacctgggaccaaggc 
ctgtggtcatgctatgtgggacagacaccataaaggaggctctggtgggccaacctgagg . 
atttctctggtcggggaacaatcgctgtgattgagccaatcttcaaggaatatggtgtga 
5 tctttgccaatggggaacgctggaaggcccttcggcgattctctctggctaccatgagag 
actttgggatgggaaagaggagtgtggaagaacggattcaggaggaagcccaatgtttgg 
tggaggaactgcggaaatcccagggagccccactggatcccaccttcctcttccagtgca 
tcacagccaacatcatctgctccattgtgtttggagagcgctttgactacacagaccgcc 
agttcctgcgcctgttggagctgttctaccggaggttttccctcctaagttcattctcca 

10 gccaggtgtttgagttcttctctgggttcctgaaatactttcctggtgcccacagacaaa 
tctccaaaaacctccaggaaatcctcgattacattggccatattgtggagaagcacaggg 
ccaccttagacccaagcgctccacgagacttcatcgacacttaccttctgcgcatggaga 
aggagaagtcgaaccaccacacagagttccatcatgagaacctcatgatctccctgctct 
ctctcttctttgctggcactgagaccagcagcaccacactccgctatggtttcctgctga 

15 tgctcaagtacccccatgtcgcagagaaagtccaaaaggaggttgatcaggtgatcggtt 
cacaccggctaccaacccttgatgaccgcagtaaaatgccatacactgatgcagttatcc 
atgagattcataggttttcagatcttgtccctattggagtaccacacagagtcaccaaag 
acaccatgttccgagggtacctgcttcccaagaacactgaagtgtaccccatccggagtt 
cagctctccatgacccacagtactttgaccacccagacagcttcaatcctgaacacttcc 

20 tggacgttaacggggcactgaaaaagagtgaagctttcatgcccttctccacaggaaagc 
acatttgtcttggcgaaggcattgcccgaaatgaattgttcctcttcttcaccaccatcc 
tccagaacttctctgtgtcaagccatttggctcccaaggacattgacctcacgcccaagg 
agagtggcattggaaaaatacctccaacgtaccagatctgcttctcagctcggtgatccg 
gctgaggcagccatgtgccccagttctgttgggaatggcctcatgtttctgcctctgggg 

25 gacctgctgaaaaccaggct 

>m26125 (TOXMARKER Assignment: 12;SEQIDNO: 12) 

gacttgggaggaaccagggcctacacttagccctggtaaacagcagagcatgctgggata 
attcttcccagaaaaggaaaagcaggcacttctgttcccagggaaaacaacaggagcact 

30 ttggacctccctgctgcagtcaggagtcatgtggctggaacttgtcctggcttcccttct 
gggctttgtcatctactggtttgtctcccgggacaaggaggaaaccttaccactaggaga 
tggatggtgggggccagggtcaaagccatcagccaaagaagatgagagcatccggccctt 
caaggtggaaacatcagatgaggagatcaaggacttacaccagaggatagataggttccg 
ggcatccccacctttggagggcagccgcttccactatggcttcaactccaactacatgaa 

35 gaaagtggtgtcctactggaggaacgagtttgactggaggaagcaggtggagatcctcaa 
ccagtaccctcacttcaagaccaagatcgaagggcttgacatccacttcatccatgtgaa 
gcctccccagctgccctcagggcgcaccccaaagcccttgctgatggtgcatggctggcc 
tggatccttctatgagttttacaagatcatcccactactgactgaccccaagtcccacgg 
tctgagtgacgagcacgtgtttgaagtcatctgtccctcgattcctggctatggctactc 

40 agaggcatccagcaagaaaggtttaaattcggtggccactgcgaggattttctacaagct 
gatgacacggctgggcttccagaaattctacattcaaggcggggactgggggtccctcat 
ctgcaccaacatggcccagatggttcccaaccacgtgaaaggcctgcacttaaatatggc 
tttcatttcgagaagtttttacaccatgactcctctcctgggccaacgcttcgggagatt 
ccttggctacacagagaaggatatcgagctcttgtacccctataaggagaaggttttcta 

45 cagcatcatgagggagagtggctacttacacatccaagccaccaagccagacactgtggg 
ctgtgctctcaatgactctcccgtgggcctggctgcctacatcttagagaagttctccac 
ctggaccaagtcagagtaccgtgaactggaggatggaggcctggagaggaagttctccct 
ggatgatctgctggttaacatcatgatctactggacgacaggaaccattgtctcctccca 
acgctactacaaggagaatttgggccagggcatcatggtccataaacatgaggggatgaa 

50 ggtctttgtgcccactggcttttcagccttcccttccgagctactgcatgccccagaaaa 
gtgggtgaaggtcaagtaccccaaactcatctcctattcctacatggaacgtgggggcca 
ctttgctgcctttgaagagcccaagcttctggcccaggacatccgcaagttcgtgtccct 
ggctgagctgcagtagtgacactggataccaactgtggctttagcagcagccctggttcc 
tcccaagtcacacttatggaagatgacccctttctgaggaataagtttgttccctgacca 

55 cactcgaggacccagacttaaactccacagagtcgtatgttacccccatatgcttcacct 
cactacatagctgtgttagctacatggctttaatgataaatggatttatttct 

>m34452 (TOXMARKER Assignment: 13; SEQ ID NO: 13) 

tgagccaatcttcaaggaatatggtgtgttctttgccaatggggaacgctggaaggccct 
60 tcggcgattctctctggctaccatgagagactttgggatgggaaagaggagtgtggaaga 
acggattcaggaggaagcccaatgtttggtggaggaactgcggaaatcccagggagcccc 
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actggatcccaccttcctcttccagtgcatcacagccaacatcatctgctccattgtgtt 
tggagagcgctttgactacacagaccgccagttcctgcgcctgttggagctgttctaccg 
gaccttttccctcctaagttcattctccagccaggtgtttgagttcttctctgggttcct 
gaaatactttcctggtgcccacagacaaatctccaaaaacctccaggaaatcctcgatta 
cattggccatattgtggagaagcacagggccaccttagaccccagcgctccacgagactt 
catcgacacttaccttctgcgcatggagaaagtgagtcctgcatggatgagagaggagaa 
gtcgaaccaccacacagagttccatcatgagaacctcatgatctccctgctctctctctt 
ctttgctggcactgagaccggcagcaccacactccgctatggtttcctgctcatgctcaa 
gtacccccatgtcacagagaaagtccaaaaggagattgatcaggtgattggctctcacag 
gccaccatcccttgatgatcgtaccaaaatgccatacactgatgcagtcatccacgagat 
tcagagatttgcagatcttgccccaattggtttaccacacagagtcaccaaagacaccat 
gttccgagggtacctgctccccaagaacactgaggtgtatcccatcctgagttcagctct 
ccatgacccacagtactttgaccatccagacaccttcaatcctgagcacttcctggatgc 
cgatgggacactgaaaaagagtgaagcttttatgcccttctccacaggaaagcgcatttg 
tcttggcgaaggcattgcccgaaatgaattgttcctcttcttcaccaccatcctccagaa 
cttctctgtgtcaagccatttggctcccaaggacattgacctcacgcccaaggagagtgg 
cattgcaaaaatacctccaacataccagatctgcttctcagctcggtgatcgggctgagg 
cagccaggtgccccagttctgttgggaatggcctcatgtttctgcctctgggggacctgc 
tgaaaaccaggctcaaggccactgctcacatcttcctattgcagttctccaaagtcccaa 
ggcttgttcttattcctgtgaatggcactgaagaagtcaatcgactgtcttattttgaca 
tgtgaacagagatttcatgagtacacatctcatgctgagtcacttccctcttcctcctaa 
tagcccacgtccccacttatcagccctccatggtctgtgatctgtgctaatggactctgt 
atatggtctcagtgctatgtctacagacttacatagtatgtatggttcaggtaaacagaa 
tcacagagtgtgtg 

>u33546 (TOXMARKER Assignment: 14;SEQIDNO: 14) 

atggaacctagtgtcctacttctccttgctgtcctcctcagcttcttgctactcctggtc 
aggggccatgcaaagatccatggtcgtcttccaccaggaccctgccctgtaccccttttg 
ggaaatctcttgcagatggacagaagaggcctcctcaagtcttttattcagcttcaagaa 
aaatatggagatgtgttcacagtgcacttaggactgaggccagtggtcgtgttatgtggg 
acacagaccataagagaggctctggtggaccatgctgaggctttctctggccgggggaca 
attgctgggcttgagccagttttccaggactatggtatattcttttccagtggagaacag 
tggaagacccttcgacgattctctatggccaccatgagagactttgggatgagaaagaag 
agtgtggaggagagaataaaggaagaatcccaatgtttggtggaggaactgaagaaatac 
cagggagcccccctggatcccaccttccttttccagtgcatcacatccaacatcatctgc 
tccattgtctttggagagtgctttgactacacagatcaccaattcctgcacctgctggat 
ctgatgtatcagacgttttcactcttaagctcaatcttcagtcaggtatttgaactcttc 
cctggtgtcctgaagtactttcctggtgcccacagacaaatctccagaaacctccatgaa 
atcctggacttcattggccagagtgtggagaagcacagggccactttggacccaaatgct 
c.cacgagactttatatatacttaccttctgcacatggagaaaaagtcaaaccattataca 
gagttccatcactggaacctactgtcgtctgtactctctctcttctttgctggcactgag 
actagcagcaccacactccgctatggcttcctgatcatgctcaagtaccctcatatcaca 
gagaaagtccaaaaagagattgattgtgtgattggctcacaccggctacctaccctggat 
gaccgcagcaaaatgccatacaccgaggcagttatccatgagattcagagattttcagat 
cttgcccctattggaacaccacacagagtcatcaaagacaccattttccgagggtacctg 
ctccctaagaacactgaggtgttccccatcctgagttcagttctccatgatccacagtac 
tttgaacaaccagacatcttcaatcttcagcactttctggatgccaatggggcactgaag 
ataattgaagcttttctgcccttctccacaggaaagcgaatttgtcttggtgaaagcatt 
gcccgcaatgaattgttccttttcttcactaccatcctccagaacttctccgtgtccagc 
cctgtggctcctaaagacattgatctcactcccaaagagagtggtattggaagaataccc 
caagtgtaccagatctgcttcttggcccactga 

>x74673 (TOXMARKER Assignment: 15; SEQ ID NO: 15) 

gaattccgcggccgccaacgtcctctcttacccgccaccttcttctgccacctctaccac 
ggtcaccatgtcgcaagcccggcctgccactgtgctgggtgccatggagatgggtcgccg 
catggatgtgacctccagctccgcgtcggtgcgcgccttcctgcagcgcggccacacgga 
gatagacaccgccttcgtgtatgcgaacggtcagtctgagaccatcctaggagacctggg 
gctcggactgggccgcagcggctgcaaagtaaaaattgccaccaaggctgccccaatgtt 
tgggaagacactgaagccagccgatgttcggttccagctggagacgtcactgaagaggct 
gcagtgtccccgggtggacctcttctatttacactttccagaccacggcactcctataga 
ggagaccctgcaggcctgccaccacgtgcatcaggagggcaagtttgtggagcttggtct 
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gtccaactatgtctcctgggaagtggctgagatttgtaccctctgcaagaaaaatggctg 
gatcatgccaactgtgtaccagggcatgtacaacgccatcaccaggcaggtggagactga 
gctcttcccctgcctcagacacttcggactaaggttctacgccttcaaccctttggctgg 
gggcctgctgactggcagatataaataccaggataaggatgggaagaatcctgagagccg 
5 cttctttgggaatccattttctcaactgtacatggaccgctactggaaggaggaacactt 
caatggcatcgccttggtggagaaggctctgaagactacctatggccccactgcccccag 
tatgatctcagctgccgtacggtggatgtaccatcactcacagctcaagggcacccaagg 
ggatgcagtcattctgggcatgtccagtctggaacaactggagcagaacttggccttggt 
cgaggaagggcctctggagccagctgttgtggatgcctttgaccaagcctggaacctagt 
10 tgcccacgagtgtcccaactatttccgctaagatacatctgccttggggatggcgcagct 
tactgcctgccccgccttgtcctgggctcgatctgatctggttctttcctttttagacag 
gtcactgtctttttcttccctgctttctatacagccagttgctttcaaagtgagagctgg 
ctgagccccaatacctcctgctgaataaaactgttccctgtcacagcctgggctacaact 
ggcggccga 

15 

>scr_gb-xl3044_4 (TOXMARKER Assignment: 16; SEQ ID NO: 16) 
ttttttttttttttttttctaccttctaccttttattgtcacgtgaaccatggtcctaca 
ggctgctgacaagcttggctgagcagggatcccaggggcgtcggcaggacatgaggaagg 
gttgctgggagggcttggcctcttccttgagaagacagcaaatgtatccagcctagatta 

20 agggtagggcatcccctatccctgtcagtgggcctagatctcagagccccacattaaaga 
ctgctaatgggtcagaaatgggggtcccttagatgggggtaggcagcaaggccctccctc 
cagtgttctcattctgttccggtttcatttgttgtgtccagggacggtgaagcagatacc 
agtctcaagccccagggtgcaggaagacgggaaatggggtgtgatgttagggagtgtaag 
aagggctgaggagcaggggagctgccgccgtgcagagctggcttctgtcttcacaagaac 

25 atttggcccatatcctgcttggtcactcccaggccagaagatgggtcttccatgtccagt 
ggctctttaggtggagtctgggtgggctgcttctcctccagggagttcttgctcatttca 
aacaacagccactgtttcatccagctctcaaagaccttccagtccagaccattcatagag 
ttcttaaggtgcttcagattctccgggaagctccccttcagctgtgggtagttcacgggt 
ccagacttcgtaagcaggtgcatcacgtggtcctgggtcatgttgccatacttggtaaca 

30 ttcttcacgggcgcttggagcatgttatccatggacagtgggcgcatcagcaagggagta 
gccatgcgcatcgggctcacaggtttggcagatttcggaagcttcatgcgaaggttctcc 
agttgcaggttctgggaggtgacggtcagcttgtccaggcggccctgctgctggtacagg 
aagtaagcagtggtggcctgcccagccaagagcagagccaccaggacagagacactggtg 
tacaggactccacggttgcaattgctttctggggctctagcacgctggcccaggatgggc 

35 agctgctcatggttagagatgaggtcgcgctggtcatccatgactctagcctctagcttt 
tcccccaagtgctgctggtgctgctgctgctgctgct 

>scr_gb-xl4254__5 (TOXMARKER Assignment: 17; SEQ ID NO: 17) 
tttttttttttttttgttctaccttctaccttttattgtcacgtgaaccatggtcctaca 

40 ggctgctgacaagcttggctgagcagggatcccaggggcgtcggcaggacatgaggaagg 
gttgctgggagggcttggcctcttccttgagaagacagcaaatgtatccagcctagatta 
agggtagggcatcccctatccctgtcagtgggcctagatctcagagccccacattaaaga 
ctgctaatgggtcagaaatgggggtcccttagatgggggtaggcagcaaggccctccctc 
cagtgttctcattctgttccggtttcatttgttgtgtccagggacggtgaagcagatacc 

45 agtctcaagccccagggtgcaggaagacgggaaatggggtgtgatgttagggagtgtaag 
aagggctgaggagcaggggagctgccgccgtgcagagctggcttctgtcttcacaagaac 
atttggcccatatcctgcttggtcactcccaggccagaagatgggtcttccatgtccagt 
ggctcactgcagttatggcgcccgcggctcttggtgtgagggacctcagtgccgttgggg 
aacacacaccagcagtagccagtgctcccatggcactggagtggcatatagttaccgttc 

50 teat cacacttgggacggaacgcccccgggtggacatcagggatgtggctgacttct tec 

tggcacttggtcaatactttaggtggagtctgggtgggctgcttctcctccagggagttc 
ttgctcatttcaaacaacagccactgtttcatccagctctcaaagaccttccagtccaga 
ccattcatagagttcttaaggtgcttcagattctccgggaagctccccttcagctgtggg 
tagttcacgggtccagacttcgtaagcaggtgcatcacgtggtcctgggtcatgttgcca 

55 tacttggtaacattcttcacgggcgcttggagcatgttatccatggacagtgggcgcatc 
agcaagggagtagccatgcgcatcgggctcacaggtttggcagatttcggaagcttcatg 
cgaaggttctccagttgcaggttctgggaggtgacggtcagcttgtccaggcggccctgc 
tgctggtacaggaagtaagcagtggtggcctgcccagccaagagcagagccaccaggaca 
gagacactggtgtacaggactccacggttgcaattgctttctggggctctagcacgctgg 

60 cccaggatgggcagctgctcatggttagagatgaggtcgcgctggtcatccatgactcta 
gcctctagcttttcccccaagtgctgctggtgctgctgctgctgctgctgctg 
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>scr_gb-bi275638__l (TOXMARKER Assignment: 18;SEQIDNO: 18) 

cggcacgaggcgcgctcggcgctgtcagttcgtcccgctgcccctcggcccttgctgctg 

gctctgacggcgaccgacggcgggcggggcccgggttcgcggccgagcggcgccggtgag 

ggcgcggaggaggcgcacagcgggaggaggagccgtgagcctggcacggagcggccgcgg 

ccatggcgtacgcctatctcttcaagtacatcatcatcggcgacacaggtgttggtaaat 

cgtgcttattgctacagtttacagacaagaggtttcagccggtgcatgacctcacaattg 

gtgtagagtttggtgctcgaatgataaccattgatgggaaacagataaaactccagatct 

gggatacagcagggcaggagtcctttcgttctatcacaaggtcatattacagaggtgcag 

cgggggctttactagtgtatgatattacaaggagagacacgttcaaccacttgacaacct 

ggttagaagacgcccgtcagcattccaattccaacatggtcatcatgcttattggaaata 

aaagtgacttagaatctaggagagaagtgaaaaaggaagaaggtgaagcttttgcacgag 

agcatggacttatcttcatggaaacttctgccaagactgcttctaatgtagaggaggcat 

ttattaacacagcaaaagaaatttatgaaaaaatccaagaaggggtctttgacattaata 

atgaggcaaacggcatcaaaattggccctcagcatgctgctaccaatgcatctcacggag 

gcaaccaaggagggcagcaggcagggggaggctgctgctgagtctgctgttgccggctag 

ctgcccagtggagccacgcactctgtcaccctctctcctcatgctcagctgagacatgaa 

actattgaaatggctttgtgtcacaggagactttaatecttcagattcttgtataacttt 

gaataaatggttaatgttcacttaaaaagacagattttggagattgtattcatatctatt 

tgcatttgatttctaggtcaattg 

>scr_gb-x66871_3 (TOXMARKER Assignment: 19; SEQ ID NO: 19) 

tttttttttttttttttttttttttttttttttttttttttttttttttttttttttcta 

aagtaaaaatggtttattcacgacacatatgaggaagtgtctcatgtcacagacggtacg 

tccaactccctggaatgttcatttctttggcataaaggagagaatgaggggaaagccagg 

caaaggcagctaagatgggggatgggtcggcagctctgtcgtcatcttcacagggaggag 

ttcaggggtccattagtggcaggctgattctctagaacattaggttggggcacaggtagg 

gccacttctgggcaatccaccatgccaagcccttcagtcgtccccaccacacaggtacag 

cagcgccttctggtagtcacccttagtgtcttgctggatgaagtagtacagggatttgcc 

atatttcctcttgaattcagatctgattttcaacatgtccacttcactgcgagagaccat 

gattctaatcaggaccttgtctcgagtccccttgcccttcatggagtcatacagccggtc 

agcaaagtacaggggcttgttctgaatgcactgaaccaggttcaggaaggcgttctccag 

gtctcctttgacctctttcctgatgctctccagcatgtcataaggactgtagctcttgta 

cctttcgaacactttctggaggtggcacacactgcgctcagtcatgatgctgatccactt 

ggggacatcggttcctttcctcttcaccccagcatcatagagctcccgggcatcctggtc 

aatcagctcgtagtcaataacagaaccatcctctgcccgtttaccctttgcaagggcgac 

caacagctttcggaattctccagatgtgtcagagatgatgtccttctccagatcggtctt 

gtacatttccttatacactcggttaatctcctgcagctcctggttggttcttgagcagat 

gatctcgatgagggagtcctcatcagtccccaggcccttcatggaggctttgagctcaga 

ggcatcgtactgagcaggtgtcttcaacaggcctaacatcacggtctccaggtgaccaga 

caaggccgacttcatcgccgatggcagttcctttttggtcctcctctggtaggcgaaggc 

aatgtcctgcctctgtgcattgctgcggttagtcagaatgttgacaatggtgacctcgtc 

cacgcctttggtcttgattgctgtttcaatgttcaaagcatccctctcagcgtcgaagtt 

ggtgtagggtttgaccgacccataggcacttgggggtgtagaatgctgagaatcaccctc 

caagctgagcttgcacaggatttcgtggacagtagacattttgaaaaaaaagctgggccg 

ggcacctattgcagagagcctcc 

>scr_gb- 14 93 7 9_3 (TOXMARKER Assignment: 20; SEQ ID NO: 20) 

gggatgacatagagtacaacattcagagaagttaactattaagtcgtcaggatgaaaggt 

caggaggcaggcctttaactgggctgtgagaatggagaaagcacggtgcactttaacatc 

tgctttcccagaggaaaaagtaaaggagaaacagtacaatcatagaagagtcttcgtaac 

agaagcgcgaggagagcattatggacaagttctgcaactctactttttgggatctctcat 

tactggaaagtccagaggctgacctgcctctttgttttgagcaaactgttctggtgtgga 

ttcccttgggctttctttggctcctggctccttggcaactttacagcgtgtacagatcca 

ggaccaagagatcttctataaccaaattctaccttgccaagcaggtgttcgtcgtgtttc 

ttcttattttagcagccatagacctgtctcttgcgctcacagaagatactggacaagcca 

cagttcctcctgtcagatatacgaatccaatcctctacctgtgcacatggctcctggttt 

tggcagtccagcacagcaggcaatggtgtgtacgaaagaactcttggttcctgtctctgt 

tctggatcctctcggtcttatgcggcgtattccagtttcagactctgatacgagcactcc 

tgaaggacagcaagtccaacatggcctactcctacctgttcttcgtctcctacggtttcc 
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agattgtcctcctgattcttacagccttttcaggaccaagtgactcaacacaaactccat 
cagtcacggcttcctttctgagtagcattacatttagttggtatgacaggactgttctga 
aaggttacaagcatccactgacactagaagatgtctgggatatcgatgaagggtttaaaa 
caaggtcagtcaccagcaagtttgaggcggccatgacaaaggacctgcagaaagccaggc 
5 aggcttttcagaggcggctgcagaagtcccagcggaaacctgaggccacactacacggac 
tgaacaagaagcagagtcagagccaagacgttctcgtcctggaagaagcgaaaaagaagt 
ctgagaagaccaccaaagactatcccaaatcgtggttgatcaagtctctcttcaaaacct 
tccacgtagtgatcctgaaatcatttatactgaaattaatacatgaccttttggtgtttc 
tgaatcctcagctgctgaagttgctgatcggtttcgtgaagagctctaactcatacgtgt 

10 ggtttggctatatctgtgcaatcctaatgtttgctgtgactctcatccaatctttctgcc 
ttcagtcttactttcaacattgttttgtgttgggaatgtgcgtacggacaaccgtcatgt 
cttcgatatataagaaggcattgaccctatctaacttggctaggaagcagtacaccattg 
gagagacggtgaacttgatgtctgtagattcccagaagctaatggatgcgaccaactaca 
tgcagttggtgtggtcaagtgttatacagattactttgtccatcttcttcctgtggagag 

15 agttgggaccgtccatcttagcaggtgttggggttatggttctcctaatcccagttaatg 
gagttctggctaccaagatcagaaatattcaggtccaaaatatgaagaataaagacaaac 
gtttaaaaatcatgaatgagattctcagtggaatcaagatcctgaaatactttgcctggg 
agccttcatttcaagagcaagtccagggcattcggaagaaagaactcaagaacttgctgc 
ggttcggccagctgcagagtctgctgatcttcattttacagataactccaatcctggtgt 

20 ctgtggtcacattttctgtctatgtcctggtggatagcgccaatgttttgaatgcggaga 
aggcatttacctccatcaccctcttcaatatcctacgcttccctctgtccatgcttccca 
tggtgacctcatcgatcctccaggccagtgtttctgtggaccggctggagaggtatttgg 
gaggagacgatttagacacatctgccattcgccgcgtcagcaattttgataaagctgtga 
agttttcagaggcctcttttacttgggacccggacttggaagccacaatccaagatgtga 

25 acctggacataaagccaggccaactggtggctgtggtgggcactgtaggctctgggaaat 
cctctttggtatcagccatgctgggagaaatggaaaacgttcacgggcacatcaccatcc 
agggatccacagcctatgtccctcagcagtcctggattcagaatggaaccatcaaagaca 
acatcctgtttgggtccgaatacaatgaaaagaagtaccagcaagttctcaaagcatgcg 
ctctcctcccagacttggaaatattgcctggaggagacatggctgagatcggagagaagg 

30 ggataaatctcagtggtggtcagaagcagcgagtcagcctggccagagctgcctatcaag 
atgctgacatctatattctggacgatcccctgtcggctgtggatgctcatgtgggaaaac 
acattttcaacaaggttgtgggccccaacggcctgttggctggcaagacgagaatctttg 
ttactcatggtattcacttccttccccaagtggatgagattgtagttctggggaaaggca 
ccatcttagagaaaggatcctatcgtgacctgttggacaagaagggagtgtttgctagga 

35 actggaagaccttcatgaagcattcagggcctgaaggagaggccacagtcaataatgaca 
gtgaggcggaagacgacgatgatgggctgattcccaccatggaggaaatccctgaggatg 
cagcttccttggccatgagaagagaaaatagtcttcgccgtacactgagccgcagctcta 
ggtccagcagccgacgtgggaagtccctcaaaaactccttgaagattaaaaatgtgaatg 
tcttgaaggagaaggaaaaagaagtggaaggacaaaaactaattaagaaagaatttgtgg 

40 aaaccgggaaggtcaagttctccatctacctgaagtatctacaggcagtagggtggtggt 
ccatacttttcatcatccttttctacggattgaataatgttgcttttatcggctctaacc 
tctggctgagtgcttggaccagtgactctgacaacttgaatgggaccaacaattcgtctt 
ctcatagggacatgagaattggggtctttggagctctgggattagcacaaggtatatgtt 
tgcttatttcaactctgtggagcatatatgcttgcagaaatgcatcaaaagctttgcacg 

45 ggcagctgttaaccaacatcctccgggcacccatgaggttttttgacacaactcccacag 
gccggattgtgaacagattttctggtgatatttctactgtggacgacttgctcccccaga 
cacttcgaagctggatgatgtgtttctttggcatcgctggcactcttgtcatgatctgca 
tggccaccccagtcttcgctatcatcatcattcctctcagcattctttatatttcggtgc 
aggttttttatgtggctacttcccgccagctgagacggttggattctgtcaccaaatctc 

50 cgatctattctcacttcagtgagactgtcacaggtttgcccattatccgtgcctttgagc 
accagcagcgatttctagcttggaatgagaagcagattgacatcaaccagaaatgtgtct 
tttcctggattacctccaacaggtggcttgcaattcggctggagctggttggaaacttgg 
tcgtcttctgttccgccttgctgctggttatttatagaaaaaccttaaccggggacgttg 
tgggctttgttctgtccaacgccctcaatatcacacaaaccttgaactggctagtgagga 

55 tgacgtcagaagcagagaccaacattgtggcagttgagcgaataagtgaatacataaatg 
tagagaatgaggcgccctgggtgactgacaagaggcctccggcagactggcccagacatg 
gtgagatccagtttaacaactatcaagtgcggtatcggccggagctggatctggtactga 
aagggatcacttgtaacatcaagagcggagagaaggtcggcgtagtgggcaggactgggg 
ctgggaaatcatccctcacaaactgcctcttcagaatcttagagtctgcggggggccaga 

60 tcatcattgatgggatagatgttgcctccattggactgcacgaccttcgagagaggctga 
ccatcattccccaggaccccattttgttctcggggagtctgaggatgaatctcgaccctt 
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tcaacaaatattcagatgaggaggtttggagggccctggagttggctcacctcagatcct 
ttgtgtctggcctacagcttgggttgttatccgaagtgacagagggtggtgacaacctga 
gcatagggcagaggcagctcctatgcctgggcagggctgtgcttcgaaaatccaaaatcc 
tggtcctggatgaagccacggctgcagtggatctcgagacggatagcctcattcagacga 
5 ccatccgaaaggagttctcccagtgcacggtcatcaccatcgctcacaggctgcacacca 
tcatggacagtgacaagataatggtcctagacaacgggaagattgtcgagtatggcagtc 
ctgaagaactgctgtccaacagaggttccttctatctgatggccaaggaagccggcattg 
aaaatgtgaatcacacagagctctagcagctggttccgtggctggcggactataagaaca 
gtttctattatttgctttggtttctgtgactgtgctctaggtgcaaagacacatattttg 
10 ttcccgttgctcaggctggcctcaaactctaaggctccagcaatctctggtctcagccag 
agacctgtaaaaatagacacttcaaagattatcatgaataaatatttaaataaatagtaa 
aaaaaaaaaaaaaaaaaaaa 

>scr_sc- 132690501^1 (TOXMARKER Assignment: 21; SEQ ID NO: 21) 
15 gaattctctgggcccatccgttgttctcaatggacatgacctccaggaagctaaagtcca 
ggtcgtgaccaaagccaaggttgtagagcgggaatctgccccggatagcgttgcggacat 
tcttgaggatctgggaacggtccgtctccccttcagtgggctctccgtcggtcaacatga 
taagaattgaggcagggctgctgagttctgggtggcttccttgagctctgtttaagatct 
cgattcctcggagcaagcctccattcaggtttgtggctccagccaaagaaaagcgcctca 
20 caaagtcttgggctgcttgcaaattggcgtgagacgcgggtaccagtgagcccttccatg 
actgcacttgagacccaaagaggaccaggtcaaagttgtctactggcttcatgtccccca 
atatcttaaggagcgcctcctttgtctgcttcactttctggccttccatggacccactga 
tatcaatcacaaaaaccaggttcttgctcatgttggtcaggttttggggggcaaagaaat 
gtgtaaagtaattgttggccaccaggaggtcacagagcttgtctcggttcacatcgtagg 
25 tcaccttgaagtctccattcagcaaggaggtagagcacgtggggcaggactgctgctggc 
tcacagtggggcggaagagcacatgacccttcttccccgagaaagacttcttgatggttt 
gagcacttgactggtgatgacgtagtgggcaaagcgagaggtgactttgcaattg 

>scr_gb-awl41735_3 (TOXMARKER Assignment: 22; SEQ ID NO: 22) 

30 ttttttttttttttttttactgtatatgtaatttaattcaaattggaacaatgacgtaga 
tatataagccacaatccatgaaagtcttggaggaaaacataggagcagttatttctgtac 
ttgattttagtggtgagattcttagctgtggcatggatacacatgatcagaacagtatta 
aataaggagaacgtcactgaaaagagcaatctgtgtgcatcaaagaacattatcaagaaa 
gcaaagaagcaatgtgtataaaacgtccctaataggtaaatctacatagataaagagaag 

35 attggtggttagacaaccagagggaggaagaatggagagtcactgagtaatggttacagt 
gtgtttgaaaggggataaagataagatcgtggcctgattttacccataaattgttgattc 
tttacacaagaataatggttagaggaatgagccacaatagcagatattatccaaccatta 
atgaaacttatgaccacttcttaaatttttatttatttttttaaaatttacttgtttctg 
cataactttgagtgatgttacatgcttatacaggatgctggggccagtagtagccaaata 

40 aaggcatcaagacatgggtggaaactggaatttccagaggttgtaagcagccatgtgggt 
ggtgggaaatgtccctgtgtcctttgcaagatcagcaacttttcctagtatctgtccttc 
tctccagcattcttacacattgattcagttctaccaggctgtaagttattggctataagt 
tatgagtatcagcggcatagcaaaggctatatggcatcattagacataacctgcaaaagg 
gcacaaatgcattcaggatagggagagctgaatgcaggcatcataagatcaggctggcag 

45 gaagaaagtatcctcatcttggaacatggtttccccctacttgcccatcctgacagagct 
ttggagtggtggagatactgaagagaggactctccccatgtagtaaatgtgtctttatgg 
agatgagaacctgccacagaacagaatgctgctggttttgttgtgcttgatgaagaaaag 
gaaggggtggtcagcacagaatgttgggacaaaagcagcacagcagtattctatgacagc 
ggaggctgctgcagcctctgtgccttcctcattgacctccactacgctcttgtgaacaat 

50 cttggacacacacaggtttctctctggagacattgctgataagtcagccttggcctcttg 
gaagacatccactattcccaagcgctgaaacacagactccatgtcataatcctcttgcag 
tttaaattttggaaggaaaacctcaacattagtgttcttcataaagtctgggttggtcca 
ggctgttaacttctcaaaagtgagattgctttccaccttgctgaggtccccgtcattatc 
tgggagtaggaccacgaagctcagctccattccttcatatggcatcatgagcacttgcgc 

55 ctgcacctcgttcacatgggcaaggttatatgtgtcctcacaacacatcatctgcactag 
t 

>af 184983 (TOXMARKER Assignment: 23; SEQ ID NO: 23) 
gtatttcataaaacagagaggatcgcaggaggccggcactctgactcctggtggatggga 
60 ctagggagtcagagtcaagccctgactggctgagggcgggcgctccgagtcagcatggaa 
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agtctctgcggggtcctggtatttctgctgctggctgcaggactgccgctccaggcggcc 
aagcggttccgtgatgtgctgggccatgagcagtatccggatcacatgagggagaacaac 
caattacgtggctggtcttcagatgaaaatgaatgggatgaacagctgtatccagtgtgg 
aggaggggagagggcagatggaaggactcctgggaaggaggccgtgtgcaggcagcccta 
accagtgattcaccggccttggtgggttccaatatcaccttcgtagtgaacctggtgttc 
cccagatgccagaaggaagatgccaacggcaatatcgtctatgagaggaactgcagaagt 
gatttggagctggcttctgacccgtatgtctacaactggaccacaggggcagacgatgag 
gactgggaagacaacaccagccaaggccagcacctcaggttccccgacgggaagcccttc 
cctcgcccccacggacggaagaaatggaacttcgtctacgtcttccacacacttggtcag 
tattttcaaaagctgggtcagtgttcagcacgagtttctataaacacagtcaacttgaca 
gttggccctcaggtcatggaagtgattgtctttcgaagacacggccgggcatacattccc 
atctccaaagtgaaagacgtgtatgtgataacagatcagatccctatattcgtgaccatg 
taccagaagaatgaccggaactcgtctgatgaaaccttcctcagagacctccccattttc 
ttcgatgtcctcattcacgatcccagtcatttcctcaactactctgccatttcctacaag 
tggaactttggggacaacactggcctgtttgtctccaacaatcacactttgaatcacacg 
tatgtgctcaatggaaccttcaactttaacctcaccgtgcaaactgcagtgccgggacca 
tgcccctcacccacaccttcgccttcttcttcgacttctccttcgcctgcatcttcgcct 
tcacccacattatcaacacctagtccctctttaatgcctactggctacaaatccatggag 
ctgagtgacatttccaatgaaaactgccgaataaacagatatggttacttcagagccacc 
atcacaattgtagatggaatcctagaagtcaacatcatccaggtagcagatgtcccaatc 
cccacactgcagcctgacaactcactgatggacttcattgtgacctgcaaaggggccact 
cccacggaagcctgtacgatcatctctgaccccacctgccagatcgcccagaacagggtg 
tgcagcccggtggctgtggatgagctgtgcctcctgtccgtgaggagagccttcaatggg 
tccggcacgtactgtgtgaatttcactctgggagacgatgcaagcctggccctcaccagc 
gccctgatctctatccctggcaaagacctaggctcccctctgagaacagtgaatggtgtc 
ctgatctccattggctgcctggccatgtttgtcaccatggttaccatcttgctgtacaaa 
aaacacaagacgtacaagccaataggaaactgcaccaggaacgtggtcaagggcaaaggc 
ctgagtgtttttctcagccatgcaaaagccccgttctcccgaggagaccgggagaaggat 
ccactgctccaggacaagccatggatgctctaagtcttcactctcacttctgactgggaa 
cccactcttctgtgcatgtatgtgagctgtgcagaagtacatgactggtagctgttgttt 
tctacggattattgtaaaatgtatatcatggtttagggagtgtagttaattggcatttta 
gtgaagggatgggaagacagtatttcttcgcatctgtattgtggtttttatactgttaat 
agggtgggcacattgtgtctgaagggggagggggaggtcactgctacttaaggtcctagg 
ttaactgggagaggatgccccaggctccttagatttctacacaagatgtgcctgaaccca 
gctagtcctgacctaaaggccatgcttcatcaactctatctcagctcattgaacatacct 
gagcgcctgatggaattataatggaaccaagcttgttgtatggtgtgtgtgtgtacataa 
gatactcattaaaaagacagtctattaaaaaaaaaaaaaa 

> scr_cg - 2 2 5 1 0 6 7 4_1 (TOXMARKER Assignment: 24; SEQ ID NO: 24) 

gaattcttgcagttacagagtatggctgttgtctactcgggagctcccagatcctcataa 

ctcagggacgtgtccctatttatggacaaaaaagtttgacgccaggtcgggcctacatga 

gctcttctctaccctgcaagtccccagtgtatctgaggaaggtgtattctgtcagagaag 

caaggaagatcaatgcacacctttagtctcagccccataggaggcagagtcaagcagatc 

t 

>scr_cg-57215224_l (TOXMARKER Assignment: 25; SEQ ID NO: 25) 

aagctttatagtcaggcacagctggctgttgccaggcaactgtggggcagagcatacctg 

gctgttgccaagtagctgtggggtggagcttagacagaatcccaacagatagtatagttg 

gagagggtttcagtctgtcacagtggggaggcaggggcagtagttgagttcatggtgacc 

agatcttgtgatggaggaaatttacatcatcatcccaggctagaaagcagtgagcagggc 

agagacaggagcaggttatcaccttggaagacctgacactagt 

>scr_gb-aa850767_2 (TOXMARKER Assignment: 26; SEQ ID NO: 26) 

ttgcggccgcccaagtctgccacttcaacactgtatctaaaacttgaaaggcactgtcaa 

aaaccctggtgggttcctagctttagggatccatcgttagagtcagtaaacatggcaact 

ctgcctccgggcatgtgatacgtcgccagcagaggcttgctagcccttgccacacaacgc 

tcagcttactcaaagcactgccaagacatggctgccctgagacggttgtctgggctcctt 

ccttcctataccttagggcgcccccttcacagcactgggtaagcaatcagcccctcccgg 

agaggagaagggaaggtaaaagacaaaggtatgttttacactatgcaaaacgttccagag 

ggggaagatgaacgaagtaacaagtatccaacacagggttttaaaaagcaacgacatttc 
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aaatgagcttgtatgggagaaagaaaagcaggttttcaggaaaaatccaaacacattcag 
gtgtgtcttttaagtcatgagtttatcatttattctaagttcattgggaggaaaactgga 
gactatcagcatagctgtcttactggggaaggcattcccagtgaataaacatctccctta 
cctgagctcttggcgagagattctgcccagcttgactctctc 

>scr_gb-ai011994_2 (TOXMARKER Assignment: 27; SEQIDNO: 27) 

tttttttttttttttttccagaaatttgcccattcttttatttgaaggcaaaaattccca 

tggaagtctggatgaagagagagacaaaggcttatagaaaataaattgaataactagaga 

ttctctggatccagacatagttggttgataaatttgttacctatttctcattgtatttca 

cattatttagacatagttcttgacatctctgttttgcatactgtctctggccaagagttt 

tggtcttcctttctaaatatcaagaggaaaaatggcagaacaaaccagtaatgttacatg 

gcatgtggttcctgagtatataatcaagcattagcagcagttgtagttatctgaatataa 

tgcatagatataatacatgaccgaagagacacaccgatttaaacaaccaatgtcaacact 

gaaacaaagaattttaatgctaaggcacccaatcacggtgtctttcagttatttgttgtt 

ttctttaggagactggccatacacagcagggattcaaaattgtggcttgcagtcatgaat 

caacatttgcatttgagtaacttacccatcttctttatgcttccacaaacatagtttcag 

ttgggataatcactgaggtgtgcacagccctttcttcctgtagtttaggcaatatccaag 

gctgtagaacttggggtaaggtgtaatggtgtcacaggaggagacatctactcactgtta 

aatgttgctctgatgtaggttggccatagctccccatacgatctcacagggaagccgatg 

ggtaatagcagcaggaagatcatggtctacataactgactctggaacttcttgacttata 

acttattactttttgggtttcttttc 

>scr_gb-awl42293_l (TOXMARKER Assignment: 28; SEQ ID NO: 28) 

agggaacccggtttctgaggttaagaacctggtatgaggtagaaagcagaatcggacctt 

aggcactcgagcgtcgtgtcgaagaaacattaaatagaatagaggagtaaaggggatgtt 

tcggataagcgctaggtcgagtcaaagaagtcttgcaagaagagttaagggagcaagaat 

ttctagaagcatctagataaggagtcgtagcatactgacgttactagtaataagtagggt 

gagtcggagaatcatgcgctcgatggtcataagatagtatctatcgaggagtgtaggagg 

cctcgtccttcggcggaaaagtaacgcgtagcggttaagaatcttgtcgttcattatctt 

aagggtaaggagccatcagtttagaagtcgttcccgcggtagtaagttcgcgtcgatttt 

aataagactttagattgcgtcgtttagtcgacgtagtagacggttaatagtaacggtctt 

acttccttaagcgtttcgctagttcttaagcttaattcggctactctagattttaccttt 

ggggttaagtttccgttagcgttgttggaatcggttttgcctgcggggtggacgcccgtc 

taggagaacgcattcgctacgaacggtgc 

> s cr_gb - bm3 8 3 3 2 7_1 (TOXMARKER Assignment: 29; SEQ ID NO: 29) 

tttttttttttttttttgatggccagtgacagtttttgctttttttatatttataaacaa 

aaccaacctcccccccaagtaactccccaaacaaacaaaaaaccagattaaataaaattt 

acagtgaacccagcaaacatctgtatgtgcaattaaatactgtgtctgttactgtggtgg 

cacgaacctcaaacaaacaatatacaagtgttctggggttggatcaggggtcgggggagt 

cccaagttttaactctgtggggtttggggagacaaggtggggaattgaacgaatggggaa 

atcaatttatttttcttaattctgtccatataaatatattcatgaagaccaaaagaggga 

agggcagttgggctggtgatgaagtgggagaaggggagggcagagccctctcaactctac 

tcagccaaaaatatgaaacaaattaatttcatggtgggagaagagatttaaaaaatgata 

gaagatgggaaggagggggagacagaaggggaccaaccagggaaaagggggacccatggc 

aagggagtcccatgtcaaggagtcctgtgccggtgtgagaatctgtctgcttctctcttc 

agccataatgtggtaagctctggcccaatccgccttcggctcccggcttggcccttgctc 

ctattgtgccagcccctcccgcctccagctattgagagctagctcgctccaggatcctca 

ggtcgtagttctttttagctactcgaagtttgaagcgactcacagagttgttgaggcgaa 

gggaggcattgtgggcagccaggggactggggaacacagccactatagtgtacaaggcag 

cgaggtccgcatggcggccattctcagcagtcccactgttgtcccccccacctgcaccag 

gcaacccctgagcatccttaagccactggatcttggcaccagacatggcaagctgtgtga 

agagtttgtctgcctctgtgcgggtgattccttccgggagatcagtcacctccagtaccc 

ttcccagcacaacatccgctgtccccaggtcagtggaggcagacttgagtgcttgtctct 

tgcctcggtttccatgcttcaatccactctgtccctggtgcaccgtatacgttgactggc 

catgg 

>scr_gb-bm386625_l (TOXMARKER Assignment: 30; SEQ ID NO: 30) 

tttttttttttttttttcacatgtcaacaactgctagctactattaaaatactgtcaccc 

aaggaggtggaatgtttaacagaaaataggctttaacaattcatactggtcctcaataac 
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tgcagatgactagttcaagccaactgcaaaactgagcaagaaatgcagcttgaagaacag 
gacaataaaatttaatcttgcaacttgatagacttggaggcattccggtcaatgtagaag 
accttgcgggcctcagagttaaagcccaggccagcccctaggctgtacttccagctcatg 
gcccggtcgtagtcctgctgcagactctgctggagcgtatctgaagacttcttgtccagg 
gccatgttggacctgacagtcatgctgggaggacggttgaatgacggggatagatgctta 
aagccgcccataagtttcaggaaattttagtttctgtttcttcattttcaaagcccgcag 
tgtcccactggccaaactgggttccctgatctatgcaggcctcatccatattgccttttt 
tttccagtaccacctccaagtctgtgtctgactcctctttttcctcctgccaggggtctt 
cctttactccgctctcttttctcctcttctttttcttccttttcacagccagcccttcac 
caactggctgctccaccttttttttggatttcactttcttcttcccaggggccttcaggc 
tgtctatcgggatgaaatccaacgcttcacttctgactgacttcttattccctttcttca 
agctgttctccatggagatcttggagttgaccgggaggatgtctccctcctgatgagtct 
ttttcttcttcttctttttcatgctgtgatctctggggctcccctgcttccttttgtgcc 
ccaaggccgcctgctcttcggcctctgccccctctaagcatgagtgcaaagcatccccag 
cctcagggatccaagagtcctggggaggaaaagcttccatgtccggaagcttcttctcct 
tcctgtgctttttaggcttcctaccagctttgcttacctccttggcatgcttggagtctg 
gggaagtcttcagccctgagccttgggaggcaacctttgataaggacttccgcctcttct 
ttttcttctctctgatgaggccttctgctgactgctcaaggacctgccttctaaggctag 
gtgactttatctgtcttgtccatgtaggctcatccttgcccaggtactcatccaagtgtg 
tgctacaggacttttttctcttctttctcttgcccagtgacatctcagggacctgcacct 
cacccacattgttaaaaggggatgtagcccttggaggagaaacatctatgaagtaatcat 
tattgtttaagactgagtactgagtctctggttctgagacatttgccaccttctttttct 
tcttcttctttttcttcttctctgggagccgtgggcccaggtcttctttctgagtcttgt 
tgaccattactggtctattagcaggccaagcatccccacgtgagcacccgcgcagccgcg 
acccggaagtcagcttcgaatttctggcccgccccctcgaaatcgttctccttccgggtc 
gcagcttcgcggcgccctgggttgctgtagaaacggcgtccatggccgtgcctagacaag 
catccagcctcagcgtgctgcgtgaggagacgggaggcgctgcggactcgccggtcacta 
cacgaatgcccgggctcgcagggtcgcctgggtcccccgaagttctcgtgttacccgcgc 
aggtcgccgagcctccggggaagaacctgtgggagcagatctgcgaggagtatgaagccg 
agcagcctacctttccggaaggatataaagtgaagtttagtttcctgccttgcccggaat 
gctacgctttcacgtggccatcttccccgcagttgttgacatgcctagtgaccgtgacct 
ctgacacccgttttcccacttttgccaggatctgtattttaacttacttcagagtcctct 
tagttgtcttggtttgggggtggtttgggggtgttgggataacagatggggcaaggctgt 
agccctactgagctgtttccagaggccgttgtcaggaaggatttccagtgttacagcccc 
agagtataacagcagcgccctgttagcttaatggtccccattggttctgtggctgcggct 
caccaggattctcccattcaaaaggcccagacatggctgacagcctcctctgtaggtctg 
actgacaagctaccacgcgtcttaggtaaatagtaaagcctttattttcttgttaagaac 
agcattttgaaaataaaacctatctgcccatgcttaacaacctttaaagtctgtgatatt 
ttatatacagccctgtacatactgattgtctggaaatttcttaaacagtttttgtttata 
agtatgcaagtcagccaggatgaggggaagagtgagggtacattataaaatacacattaa 
tacatttaataaatatatattatctatcaaaaacgagccatagctcttaatgaataaagc 
acctgccaagggctctcatcagctcacagttgctacatccttggatgtgtaaatgccagt 
gcccccttctactttgccatttggcaaattcaaaagacaactcttccaccaccctgcact 
tgttccctggccttgacctcctctgtgtgggggtggggcagacaacaaccagatcttaac 
tttagaaacagctgacacattggagcccctcccctctgccattgtcctgctaccttggca 
actgactccagacctctatggagtcttcactcaggaggggacagagcggtggttatagtc 
ccaatatggtattagtacccgggcatgccaagttgtgcttgcagtttggggttattcaca 
gatgactttctagaccattttccccaaccaagtgttgggtgtatcaacacttaaacaggt 
gccatgggattatgcatttcagccttgctctgtcagaagctggctgccacagtatctggg 
tggagttgcctcgtggtcctcctcgtg 

> sc r_s c - 1 3 3 5 5 6 9 6 9_1 (TOXMARKER Assignment: 3 1 ; SEQ ID NO: 31) 

tgtacaatgggggataaaagtgtcaaatgagatgttgctatagtttcatttcttttgccg 

tgatagagcaccctgacaaaaagcagcacgagaggaaatgtatctggcttacgattccat 

gttaaagcccgtcattgatgaggtgggtcggggagtcaaggtaagactgtaaacagctag 

tcaatcacatccacagtcagagacagaaggacacaaattcatggatacttgctcctttgc 

actcagctcagtttctccactcttacacagttttaaatgccctgcctagggagtgatgcc 

acccacagtgggctggatgttcccacatcagttatgacaatctcccacctcatgcccata 

ggccaacccaatgtagacaatctctcattgagactctcttcccaggccatgtcaagctga 

cagttatagctagc 
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>scr_sc-170142736_l (TOXMARKER Assignment: 32; SEQ ID NO: 32) 

agatctcttaagtgaaaatagaaaaatgattactaacgagaagatagacgcctacaacga 

agctgcagtcagcattctgaacagcagcaccaggacatccaagtccaatgtcaagatgtt 

cagtgtttccaaactcatcgcccaagaaaccatcatggagtctttgggtggcttacacct 

tcctgaatcaagcagagaaactagtgcaatgattctcatga 

>scr_j3C-2563586_2 (TOXMARKER Assignment: 33; SEQ ID NO: 33) 

tcatgactcccagcattgacattcccctacaatagggctttgagccttcacaaaaccaag 

ggcctctcctgccattgttgctcaacaaggccatcctctgcttgatatgcctctcgagtc 

atgggtcattccatgtgtaatctttggtggtttagtacctggcagctctgcatggttgat 

attgttgttcttactatggagtgacaagcctgttctgcttgttcaattatttgtctaact 

ccttagttgagtaccctgtttgcagtccaatggttgggtgtcagaatctgcctctgtatt 

tgtcaggctctggcagaggctctcaggagacagctatatctggctcctttcagccagcac 

ttcttggcattagcaataatgtctaggtttgatgactataaatgggatggatccctaggt 

gtgatagtttctggatggcctttccttcagtcactgctccacattaggtcttgatatttc 

ctccttattttgtttccctttctgccccatcgttgtgcccttttgatagattttgcagtt 

tagaaatacaatttacgtgcaggtttattgcattcagatct 

>scr_sc- 87618257JL (TOXMARKER Assignment: 34; SEQ ID NO: 34) 

tcatgatgaagaaatgggttctcggcaataggcaaaggcaggatgagagcagaggggtcc 

atgggggtcgaaggctgcccatgggggtggttctatgctctgaccattttgagatgaact 

aataatgttccggcagtggctatcccctaacaaagatcacaagccgcctagtggagggaa 

tggaatctgaactctggtaccagcctccaagatccagatct 

>cszr 96561134^83 7604 93 (TOXMARKER Assignment: 35; SEQ ID NO: 35) 

gaattcactagaccagcatattgctctatgctgcctttccagcgctgtactgcctgtagt 
ggaacagactcttggagtccacagtacgagctttctgcacagcctcagcaaaaagtttgg 
tcacctggaaattggtgagcagagcaattccactgtccacagctgtcctccgaatcacat 
aattatcatggacaaatttggtgttgttattggggaggttaatcactaggtcaatgcttc 
cgtctcttatcaactttctgatggaagagaggctgggattctgtccttcctgagatggcc 
aagccactggggtggcaggaacattgttggcgttgagccagtctgatgtggcttctgtgg 
caaaaagctt 

>scr_gb-x83855_l (TOXMARKER Assignment: 36; SEQ ID NO: 36) 

ctagtccccgcagcctagcgcgggcggcggcgggcgatggaggagagcagagccccgggc 

cccgccgtcctccagcgcgctccgctgcaaccccgcagctgagcccagaggctccggccc 

tgtgcgccctaccgcggccccgccactatggccggcgtgtgggcgccggagcactcggtt 

gaagcgcacagcaaccagtcaagtgctgccgacggctgcggctctgtgtccgtggccttc 

cccatcaccatgatggtcactggcttcgtgggcaacgcgctggccatgttgcttgtgtcg 

cgcagctatagacgccgggagagcaaacgcaaaaagtctttcctgctgtgcattggctgg 

ctggcgctcaccgacttggtggggcagctcctgaccagtccggtggtcatcctcgtgtac 

ctgtcgcagcgacgctgggagcaactcgacccatcggggcgcctgtgcaccttcttcggg 

ctgaccatgacagtgttcggactgtcctcgctcttggtggccagcgccatggccgtggag 

cgcgccctggctatccgtgcgccgcactggtatgccagccacatgaagactcgcgccacg 

cgcgcggtactgctgggtgtgtggctgtctgtgctcgccttcgcgctgctgcctgtgctg 

ggcgtgggccgctacagcgtgcagtggcccggcacgtggtgcttcatcagcaccgggccg 

gcgggcaacgagacggactctgcgcgggagccgggcagcgtggcctttgcctccgccttc 

gcctgtctaggcttgctggctctggtggtgacctttgcctgcaacctggcgaccatcaaa 

gccctggtgtcccgctgccgggccaaagccgccgcctcgcagtccagcgcccagtggggc 

cggatcaccacggagacggctatccagcttatggggatcatgtgtgtactgtccgtctgc 

tggtcgccgctattgataatgatgctgaaaatgatcttcaatcagatgtcagtagagcaa 

tgcaagacgcagatgggaaaggagaaggagtgcaattccttcctaatcgccgttcgcctg 

gcttcgctgaaccagatcttggatccctgggtttatctgctgctaagaaagatccttctt 

cgaaagttctgccagatcagggaccacaccaactatgcttccagctctacctccttgccc 

tgcccaggcttctcagtcctgatgtggagtgaccagctagaaagatgatgaacaacctga 

agcggagtttcattgcaatacctgcttccctgagtatgagaatttcttcccccagggaag 

gataactgaatcattttggattgtatcttctttcggcctcatattttaagttttccttgc 

cattaaacacaccgagacaagctt 

>cszr_2 2980046 5_1 90907286 (TOXMARKER Assignment: 37; SEQ ID NO: 37) 
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agatctctacaccgcaaaaggtctcttccgtgctgcggtgcccagcggtgcgtccactgg 
catctacgaggccctagaactccgagacaatgataagacccgcttcatggggaagggtgt 
ctcaaaggctgttgagcacatcaataaaactattgcacctgctctggttagcaagaaact 
gaatgttgtggagcaggagaagattgaccagctgatgatcgagatggacggcacagagaa 
5 taaatctaagtttggcgcacatgccatcctgggagtgtccctggctgtctgcaaggctgg 
tgccgtggagaagggggtgcccctttaccgtcacattgccgacttggccggcaaccctga 
agtcatcctgccggtcccagctttcaatgtgatcaacggcggttctcatgctggcgacaa 
gttggccatgcaagagttcatga 

10 >scr_gb-bi277612_l (TOXMARKER Assignment: 38; SEQ ID NO: 38) 

gggcccctcctgctcgctgctgctggaggcgtttcggcgatattacaactatatttttgg 
tttctacaagagacatcatggccctgctaaatttcaagataaaccacagttagagaagct 
tctggtcttcattaacctcgaaccgcagtgtgatgccttccctagtatgtcatcagatga 
gtcctattctctacttgtacaagaaccagtagctctcctcaaggccaacgaagtttgggg 

15 agcactaagaggtttggagacctttagccagttggtttaccaggacgcttatgggacttt 
taccatcaatgaatccactattgctgattctccaagattccctcatagaggaattctaat 
tgatacatccagacactacctgcctgtgaagacaatttttaaaactctggatgtcatggc 
ttttaataagtttaacgtccttcactggcacatagtggacgaccagtctttcccttatca 
gagtatcacttttcctgagctaagcaacaagggaagctattctttgtctcatgtctatac 

20 accaaacgacatccatatggtacttgaatatgcccggctccgagggattcgagtcatacc 
agaattcgatagccccggccatacacagtcttgggggaaaggtcagaaaaaccttctaac 
tccatgtttcattcaaaaaattagaactcaaaaggttggacctgtagacccaagtctaaa 
tacaacatacgtattctttgacacattcttcaaagaaatcagcagggtgtttccagacca 
gtttatccacttgggaggagatgaagtggaatttgaatgttgggcatcaaatccaaacat 

25 ccaaaatttcatgaagaaaaagggctttggcaacaattttagaagactagaatcctttta 
tatcaaaaagtaagtcatctgaaagcctaatcaccactgttttcatacaagtccaagctg 
cgacttagctctctgctttacttctcatcttccccactgcttgcaagagtggagccaaga 
acacctaggaggcagtaagcattttgcagtaactactgaaatagagggagaagccatgcg 
cccgctaggagctctggctgccctttgtcttttgcactatccaggggctggaactcactc 

30 cctttgtcctgagtgacctggggcatctctgctccttacacagtgcagtgacatttccaa 
cattccacagccagggaattggtactgaagtggtggctgccttgttagaaaacacagaca 
gaccacttcccaaaagtttggtggacagtctgttctctaagaatcagcacatttttcccc 
atagggaccagaccacacttaggcatcatgggccatgtggagttgcaaatctcttttana 
a 

35 

>scr_gb- j 05266_3 (TOXMARKER Assignment: 39; SEQ ID NO: 39) 
ttttttttttttccagagcagaggtcttttttaatcaatcacaaagtactttaaaatctc 
ataggggacagccttgaatcatctatGcacgctgattgtaccggtaagtagaacaggata 
agagcaattcgccagctgcagcacagtctggtacacgagcagcccggggccagccatgcc 

40 tggcgttacaatgtgctctcacaaaagtaactcatggaactcaacgtgaagtcgcgcttt 
tttttttttggttcttttttttccggagctggggaccgaacccagggccttgcgcttcct 
aggcaagcgctctaccactgagctaaatccccaacccctgaagtcgagctttaaataata 
acctgagttaaattcccagggaaaggagggcactgactcctacaggctgctctctgacct 
ccacaagtcccaggatacatctgagcccgtcccacacaaactagcactcaatatggaact 

45 tttattcatgtgatttctgtacatcagggagtacaagagtaaacctttacaaatggtgct 
gattttaccacaataaatgacaaaaccaaagcagtgtctggtgacagtggcagggcttta 
aggttcaaacccagccaagaagtttgttacgatttccttcagctttgcatccgactgttc 
tgagattttcccatcagacctgatattgcccaagaggctctggtgctggctcacaacatg 
agacaagaaagcactctcgaactttgtgatcttactgggctccagtttatcaagataacc 

50 ccggacgcctgcatagatgacagccacctgttcttcaatagccatgggagagtactgtcc 
ttgctttagcagctcggtcaggcgcacgccacggctcaagagctgctgagtggcagcatc 
cagatcagaaccaaactgggcaaaagcagcgacctcccggtactgggccaactccagctt 
catggtgcctgccacctgcttcatggctctggtctgggcggcagatccgacacgggacac 
agacaagcccacattaatggcagggcggatgcctttatagaacaattctgtttccaagaa 

55 gatctgtccatcggtgatggaaataacgtttgttggaatgtaggcggacacatcaccagc 
ctgtgtttcaatgactggtaaggcagtcaaagagccaccaccaaaggaatcgttcatctt 
ggctgctctctccagcaggcgagagtgtaggtaaaacacatcaccgggataggcctctcg 
acccgggggtcggcggagcagcagagacatctggcggtaagcaacagcctgcttggataa 
gtcgtcatagatgatcagagcgtgcttgccattatctcggaaatactctcccatggagca 

60 gccggagtaaggagccaagtactgaagcggggcagcatcagaggcagtggctgacaccac 
aatggtgtacttcatggcatctgcgtctgtcagtctcttcaccaactgagcaacggtgga 
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ccgtttctgaccaatagcaacgtagatgcagtacagtttcttcttctcgtcagtcccatc 
attgaaacgcttctggttgatgattgtgtcaatagcaatcgaggttttcccggtctgtct 
gtctccaataatcagctcacgctgacctcggccaatcggcaccaggctatccacagcctt 
gatgcccgtctgcattggttcccgcacagagattcgggggataattccaggggctttcag 
5 gcccactcgtctgcgaatcttggaaccaactggacccttcccatcaatggcatttcccag 
ggcatcaactacacggcccaacagttcatcgccaactggaacgtccacgatggctcctgt 
tctcttcacgatatcaccttctttaattagcttgtcattcccaaacacgacaactccaac 
attgtcgggttccaagttcagggacataccctttaagccggaagaaaactctaccatctc 
ctcagcttgaacgttcctcagtccatgcactcgggcaataccatcaccaatgcttaagac 
10 acggccagtctcttcaaggtcaacagaagtatcagctccaaggatccgctcctcgagaat 
ggaggacatctcggcagtgccagtcttctgaagtcgagtgttagaggcatggagatttct 
tgtaccaacaaaagatgaccccaaggcatttttggagaccagtcccgcccgtcgagggag 
ggcacggcg 

15 >scr_gb-m37394_5 (TOXMARKER Assignment: 40; SEQ ID NO: 40) 

ttttttttttttttttttgcttgtttgtttgtttgtttacttcatgaaatgaaaacagga 
aagcatattaaaactcaaaacaatgaaacagaaaacataaaaggtagtctaatagtcaga 
aaacactggtaaactagcgtgtgttaagtatcagggacatatttatacaaaaaagtaagt 
ctgagggaaaattctacccagtcattcttctcccagtcccagtaagtaacaaagtggctt 

20 atcctattgtacctgccatggtttaatgctgtacaagtgtggcctgctgagcacatccag 
gacttcttgtgcatgtagttatcttgccatggaagtgtcttgatgcagagctgctagaac 
caactgtctggtcagttggctccaggcaactctgtgtaatacacgctacgggcaagcttc 
ttcctttatggaagagtgcatgaatcaaatcaataaagacaagaatcccagagttcccta 
tgtcagcaagcgccataggtctgttttttttccccctatgtacctcaccatgaggcaacc 

25 ttctgttccaaaaggacaatgttctcgatggatacctttcagtggaatcttcacagttcg 
aagaccaatagatataccttcaacttcccaaagagcatcaggggaggggcccacttcttg 
gctcagtgacaaagcccgtcagagttatgctttaaagccagtctgagggtttgacatttg 
acacaatgtggacatggctgtcaggagcagaggtgctgccatggcttggtcctgggcctc 
tggaaagtccggtttgtaactggtacaatgcctcttcaatgtcatgctccactaaactca 

30 ctgcttggcggtgccacccgcaggtactctgcattttcagctgtggggcccttaaagatg 
ccattcggcttggcttctttgggaaagaagtcctgctggtagtcagggttgtccaggctc 
atttggtggctgcctttctggatccagagggcagagctgtcaaacccactactgaggcag 
gtcggctgggcagtgttgagatactcagggttgctcaccgcattgctatggggattttga 
taatgcaggtctcttccaggagctggatgcaggggctgattgtgatagactgggttctgc 

35 acagagccagccggcctcttgggaacagattggtttatatattcaggcacgggaaggaat 
gtgtcatctatgttgtcctctgtcaggacgctggtgggatcggagctataccgttgcaag 
aaggcgtcttctttgacacggcagctcccatttctattaatgcaagccacagtggaactg 
ttgctatttgcactcagagagctcaagagtggagtccgtgacgtggatgggctgttgaag 
aagccttgctgtgggatgaggtattcatcagcatcaactacgtcttccatgtcctcctcc 

40 tccatcagggctcggtaaaagttggagtctgtagggctcggcaaatgcatcctttcatcc 
ccctggataacaaggtagcgctgtgggtctctggccattttggagaattcgagaatcaac* 
tctcggaactttgggtggctatcagcatctatcatccagcacttgaccatgatcatgtag 
acgtcgatggtgcagataggtggctgtggaaggcgctctcctttctctaggatggatgag 
atctcacttgcagggatcccatcataaggcttggacccaaaggtcatcagttcccacacg 

45 gtgactccatagctccagacgtcgctttggtgtgtataaattcggtgtaaaattgattcc 
aaagccatccacttgataggcactttgcccccctctgcatggtattctttctcctcagca 
ccaagcagtttggccagtccaaaatctgtgatcttgacatgctgtggtgtctttaccagt 
acattcctggctgccaagtcacggtgtaccaaacgccggtcttccaggtagttcatgccc 
tttgcaatctgcacacaccagttgagtaggtactgggagccaatgttgtccttatgttct 

50 cggacatagtccaggaggcaaccatagggcatgagttgtgtaatgagctggacagtggag 
gtcagacagatgcccaggaggcggcatacatgagggttgtccacactggccatcacgtag 
gcttcatcaaggatttccttgttggctttgggagatgtggcttctcttaactccttgatg 
gccacagggattttcactttctcgccttctgggatccagagacccttatacactgtgcca 
aatgctcctgaacccagaactttgatctttttgaattctgtttcctttaatatcctcaag 

55 tgggcttggttcggagcttctccgctgggtgtgagaggttccacgagctctctctcttga 
agcaggcggcgtagtgtacgttttcggacaagctgacgtcgacgcatgaagaggccgatc 
ccaagggccaccactactatgaagaggaggccacccacaatcccagtggcgatggatggg 
atctttggcccttctggttgttgacatcctttaaggcctggcccagcacatccataggta 
cagtttgcatggcagaggtggcagacgttattggcatctgcaaacttccagaccagggtg 

60 ttgttctcccccatgatgcccgaagggcaggtcttgacacagtggggaccatcaacatag 
tgggcacacttgatgcagttgtctggcccccggcctgtacaggtgatgttcatggtctgg 
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ggcagacattctggatggcactggatgcattcagaattttccacaaactccctcggttcc 
ccctccaggatgttgcacttgtccacgcactccctgcctctgctcacattctggcaggag 
acacagtccgtgggctcagggccccagcagccttccgaggagcataaaggattacagacg 
tggttcgtggccttgcagtccttttcagctctgttgttcatgattttggtcttttgattg 
5 ggcgtcccgaagagttttttccagtttatagtgtttgcgtagcacaaatttcggttccca 
gaaataatcacatccccatcactgatctccttgagggaacgcaaccccagcgatgttatg 
ttcaggccgacaaccgccagagaaaactgaccatgttgctttgttctgccacgaattatt 
tctaggttctcaaaagcatggaggtcagtccagttttcaggccaagcctgaatcagcaaa 
aaccctgttatttccttcacagttttgagaatttctagttcccgtgggtctagaggagga 

10 gtgcgggtgaaagaatcccccttaaaggccactggcaggatgtggaggtccccactgatg 
gcagtgcagtacttgaagtgtttgatgtttgtagcatttatggagagtgtgtctttaaat 
tcaccaatgcctatgccattgcaaactttgcggcagggcccgtcacattttttacacttg 
ctgactccatcttcttctacttcatagtagtctggcccacaggcccggacacacgagccg 
tgatctgtcaccacgtagtttctggggcatttcttcacacaggtggcaccaaagctgtac 

15 ttcccctcagggttgacatccatctggtacgtggtggggttgtacagcatgagtggtggg 
caggtgtctttgcacgtggcttcatctcggaacctgtggcagaccagacagtcactctct 
ctgggccctgtacaccctgcggcacactggttgtggcagcagtcgctaggggacctgcca 
cgacaacgccgggaacattgctgggcgcagatgattttggtcaatttctggcagttctcc 
tctcctcttccccagcagcttccattgggacagctcggatcacatttcgggcagcccgtc 

20 aggtggcgctgtacgtccattgacatgttgctcagaaagacatcttggacgatgtccctc 
cactggatggtctccatattgcagaggatggggttgttgctaaatcgcacagcaccgatc 
agaatttcctgtaagttccgcatgggcagttccctaagcccagttttgttggttccatag 
ttggacaggacggctaaggcgtaggtgttttcgtagagagcatttcccctgatgatctgc 
aggttctccaaagggattctctccacggtgttcagggcaatgagaacatagccagccacc 

25 tcctggatggtctttaagaaggaaaggtcataattcctttgcacataggtgatttccaag 
tttccaaggaccacttcacagttgttgaacatcctctggaggctcagaaagtggtcttca 
aaggtgcctagttgggtgagcctgttacttgtgccttggcaaactttcttttcctccagc 
gccccacctgcggcgcagagcgcagccagcagcagcagtagcttggttctcgcagtccct 
gagggtcgcatcccggctcggcagtcgttggctctggctctccgggattaatccgagtca 

30 gactgagtcccacggtcgtgcccggtgactgcgtcggcaacgacgacgggacccggactc 
agactcgcgtccaggtgacccgtcgcctgtcttggtggcggtagcctccgggactggctc 
cagacgctcgagcccaggaagagcgcacagt 

>scr_gb-m64300_4 (TOXMARKER Assignment: 41; SEQIDNO: 41) 

35 tccgatctgagcagacagctacagccaacagatggcgtgtaagtttggagctgtcactga 
cttaaggtgccttatgtcttagccttccctaatgtaaggtgggtgggcataactggaaca 
agtctgttaagacttgctctgaggaggctgacagttcagtaggtgacatgtaggaaggat 
tcagggcagggaggaaccactgcatctttcatccgacacagtagttactgactaaacaac 
agtgagcacttgagtgcactgagtgcaactgtgcagggcctggtgcaggagaactctctg 

40 gactgaagaattccgtgaaagtataaaagccactacgaccagaactgcccctcggaacgg 
ctcaaaggagtcaagagtgggtaagctgagacgggctggagacaggaccagggtcaagaa 
ctggggggacaccgacatctgaacgcgtccagtcctctgagcccttgtcctgaccaattt 
aagatctgtatcctggctggaatcgagcagtctcttcaaaaatgagttctttgagcttct 
ccttaggtaagtcgtccagctccatgtcaaacttgaatggtgcttcagcaatgggctcat 

45 cacttgggtcataatactgctccaggtacgggtgggccagagcctgttcaacttcaatcc 
tcttgtgagggttaaatgtcaacattttatccagtaaatccagagctttggagtcagcgt 
ttgggaacaacctgttccacggcaccttatttttgtgcgggagagaaagcaaatagtttc 
tagcttttaaatttattatacaattcagatcttcctgtgatggagatccaagaataccca 
ggatgtgattcagctggtcaaggtaatgctttcctgggaagataggcctgttggatagca 

50 tctctgccaggatgcagcccacagaccaaatatcaatggacttggtataacccttggaat 
tcaacataatttctggagctctgtaccaacgcgtggctacatactctgtcaagaaccctg 
tatgatcatggtctggatctgcaacacgggcaaggccaaagtcacagatcttgagatcac 
aagtggtgttcagcaggaggttggaaggcttgaggtcacggtgcagaacattagctgaat 
gtatatactttaatcctctcaggatctgataaagaaaatagcagatatgatacttgctga 

55 ggtgctgtgtcttcaagagcttgtaaagatctgtctccatgaggtcctgtactatatata 
catctttcatctgctcaatggttggtgcccggatgatgtcattgatgccgatgatgttct 
catgtctgaagcgcagtaggatttttatctctctcagggttctctgacagtaggtctggt 
gctcaaaaggactgattttcttgatagcaactcgaactttgttgagattatcataagcag 
aacaaaccatgccgtaggcgccttctccgatgtacgagagattagtgtagcgcggcccca 

60 cgtcgaacacctgcccgcggaccatctccgggcccgccgccgccgccgccgccatgttgg 
ctgcacagcctccgccgcgttgggctcgacgcttcgcgttaccgctcgacttgtgctgcg 
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cttcccacaggaaccgcgccgccgcccgtgtagccggctggcggcgatcgggaacgagga 
gggaggacaacacagaagagagaactaaccgccggtagaaccacgg 



>scr_gb-bi294409_l (TOXMARKER Assignment: 42; SEQ ID NO: 42) 
5 gtgcacagaggggactcaacggtgtgccgctgctcagactacatctggcccacaaatgtt 
cttctagagccaccagaatttaagattattggctttaaggaccacataaatgtgatgatg 
gagtttccacctgccacttacaagctattcggggaaagcttatggaaaagactggagtct 
acatccttcgtcatcgaggaacagacagaggacagcattagggtgcacaagccccaaatg 
aataatgtcactgggaacttcacgtatgtccttagagacttacttccaaagacaaactac 
10 tgtgtgtctgtttattttgatgatacacctgtaataaaatctcccttaaaatgcaccgtc 
cttcagcctgaccaggaatcaggtatggctaggcttttaaaatttgcactgttgttttga 
tggaaaacttgctgaaagaaaaaaaaaaactcaagttctggtacactaaatgtacttctt 
ccaataaatgcacatcactgagctgtttaaaaaaaaaaaaaaaaaa 

15 >scr_gb-ab015747_3 (TOXMARKER Assignment: 43; SEQ ID NO: 43) 

tttttttttttttttttgacaagataaagagtctttattgacatagagctccacgtgacc 
tcttctgtcctgccctccttgcaaacatactaggtgtcccaaaggtagggacacgagcag 
acagtcctgagcctggtcccgtcctccagaatgcagtcagactgcagtctgccatctgcc 
atccctatcatctggccaccaaccagaaccagccccacagttcccttgtggtctcgcctt 

20 ggctgccagtggtggtgtccactgggacctgccactaggctgctgtgtttgtttactggg 
atcccacttccacatcctgggagccctggcttctggccacatgtgggtaactggcagtga 
ctttgggcaatcaagtttgcgttcttgttgctttccacaactgggccaagctgggacagc 
aggctctgcttctagtctcagtccgagctgttcaatgaatagcctccttggggcagtatc 
taccctcccttaactcaaaatttccactagttagggcctcccaagccactgccaggccag 

25 ctgcgagtttctaggaccagcttccagctggagaacccgacagctatgccaggactgctg 
tgagccttgggcaaacggtctattgggtggacagaatgggcctgagcaggtagggcaaca 
agagctaggagagcccagggcttaagaatatcagcactgctgtgggagaaagcaaaatga 
gtccctgaatcccttgtgagggaggagagcccaggccaacggtaggggagacagccaggc 
tctgaacttctagggtcaggccaagttcacatcttcacttcaccattctttcgatttctg 

30 ggaaacctgccagctgggctgtctctcaggaagcacttccctggcttggaggaaccccgg 
ccttagcacagacctcagcaacaacagcacactcacctaagacacagtgacgcccagagt 
gcccacaggtacctcagtagtctggctgggaacaggagagtggccagggcccttgcccac 
ccctgacaaattggagggtgtcctgggtgctaaggtgaggttggcttcctgtgacatttc 
cccaggacagctctccaaggtccccgagagattccccaaggatggtgatttttcatcata 

35 gcaacagccgcagccagggctagcaacgacatggatctgaccatcttcctcctggctgtg 
gtgttgcttcaggtggccacacagatggcaggtgagggacgagtacacaatgccaaggcc 
caggtcatccccaaagggcttgggcacctgctctgaaggaggaggggccttcagccaggt 
gcctcctttgagccccagctctagaccaaggcattctggggtgctattgggtggggctga 
gttcaggggactgggtggcagctccatgtccagtccgaaggtgaataagggcatggagtt 

40 gggggactggttaggaacagggttctggaagggcttgtaccctccacatccactgtcagt 
ccctgctgctgctgtgtccgtgcagacgccactgctgctgagcaggctcgagaaagcctt 
gtaaccagtgtctccagaaggcccgacaccaggcacccacctggcctgggacgcaccctg 
cttcactgcctgcacaaactcttggtagccactggtaggggctggggtggagccagctgt 
cccgtgctgcaggacactcatgtgaaggatctgctcccagctctccgcttgctgcattgg 

45 tggccctgaagaatgggggtcaaccgggctcagaagatccccttcttccagatgtccagc 
ctgcttctgttctgaagccagctctccaggatttggggcggggctggagaagtcactaaa 
actccggtaggcaggattgtctgaaatgacaagggggacctgtgtgcaggctgtgccagt 
tgctctctcagggtctgggtgtggaggctgctgccctgtgacctggcatgtggtctcact 
gggccccgtggggaagcaggcccaggacgtagaagcttgcccactttctgaaggcagaag 

50 ggaggatgactctgccatgctcgactggccaacgcctccattctcagcccccagcaagtc 
tgaaaacaggttctcagtgagccgggccatgatgtctgcctgactctcctggaagccccc 
■tccgctgttctcaggtgacatgctcaggtcccctttgaccatctcatcctcttcctcctc 
cacattctgtactggggcctcaaacagctccatacagcgcaccacactgacatgaacgtt 
ctctggccagaggacggtcctgctgacctccgcaggataccagcctgctttttcaggact 

55 ctggagaggcttggttttggcagccttcggggattctctctctttcttcactctatgctc 
cagcaagcagggcagcagcttggttagacaagtcttccagtgccggctcttggttgactc 
ctggcttcgggtctgcttctcccagagggacaccttcgtgtcctgaatgatgatggctgc 
taagggactgcgtgctggagtgggaatctggtcccaccatatcttcttaatcttgataat 
gctgaagtaacaggtcaggcaaaacaataggatgcagatgcaggagatgctgacacccag 

60 cgggaggcgctgcagcaggggcagctggaagtggttgtaccacgtgatgctgggactcca 
ctcactccagatgccagggaagctctgggacaagaccctcacacgtgccctatagcgcac 
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ccctgatgttagggtgttgactgggaagctcagcttgggttccgtgtaggtcacattata 
gactttgaattccgccgggttgtcctctctggagatgttgaccatgcagatgaggccttt 
gtgcaggaagttgttcgatgggtatgggttgctccacatcagcagcaggccattggagac 
attggtgtggagtgtgaggttgtctggagctgggggcttcacattgtcactaggcttgaa 
5 ggagccttgccacagctgtccccgctcagaccacagttccagccagtatgtgtctgcctg 
gatcggctcctctatggccatttggcacacacacacggtgtcggcactgttcttgggggt 
gcatgtgaggttttcagagaactcgaagagcagcctgtagtccaggaggagctgagaact 
gcagtccacagtgctatccagctgccactcacacgtagaagtgcggatgtagtcagagaa 
gcaggtggggtcacccaggaccttgatgcccccagagccagtcacccatagcaaaatcag 
10 acagctcacggaggacaggaacttggtgcaaagccgccccattgcggacacaaaggtgcc 
tgggctatacagggagagactggaatgcagctcagtggcagcgtacctggcccccagatc 
ctgggctccctctccagcacctgtgtgttcaggctccacgcgccgtgcggggctttcctg 
cgcgaaggacctcgcccggtttcctacgccgcccggacgc 

15 >scr_sc-191879433_l (TOXMARKER Assignment: 44; SEQ ID NO: 44) 

gtgcactaagaatgacaaacttgctgtgtgccacaaagatcttgggtggctggttggtgg 
ccagtggtcaggttggcctcacactgctccaagtagaagagcagcagctgtcggtctgaa 
ggccccagtccccctgtccgccccggcacaaggggctgggctggtgtccagttggccagg 
tcatggtctatgggacgagacacctcctgctccagtcgctcaaactgtttcagctgctgc 

20 agctccagttggccttttccctgtcgcacgatgttgcccttttccagcagttccttctgg 
gtcttctcaaattcctccttcccctgcagatgaacgtagtcatagtcctccatccaaccc 
ccttcactgttctcatactggccatccgga 

>scr_sc-140438096_l (TOXMARKER Assignment: 45; SEQ ID NO: 45) 
25 tctagactttaacaacaagcgtgatgaacacccagagaaatgcaggagtcggactaagaa 
catgatgtggtacggtgtccttgggaccaaagaactgcttcacagaacctacaggaacct 
ggaacaaaaggtcctgctggagtgtgatgggcgcccgattcccctcccaagtcttcaggg 
aattgctgtcctcaacattcccagctatgctggagggaccaacttctgggggggcaccaa 
ggaagatgatacttttgcagctccatcattcgatgataagattctggaggtggtcgctgt 
30 gttcggcagcatgcagatggctgtgtctcgtgtaattaagctacaacatcatcgaattgc 
ccagtgtcgcacagtgaagatct 

>scr_gb-x87157_5 (TOXMARKER Assignment: 46; SEQ ID NO: 46) 
ttttttttttttttttttaaccaagaggaggaatataattgtgataggaaactaagaatc 

35 atgaagctcactacaaaagacaaacactactgaaacatgttgtgctggccttgacacacg 
caggcagactgtcgcctagctctgaggcagagggtcaaggttgacacagggctcggagga 
aatatttaccagagagaatgtggtgattcatttatcagtccagagatcgcaagtataaaa 
cttcaagatataagaaggatcaaattatatcatgtatgtgattcaatttaaaatgtctta 
gccctcttacattatattatctggattataactgtaaaaaaaatcaaattacattcatat 

40 gaaacttttatcaaaagaaatcaaatccatttttatgaaactttatagtacaattatttt 
tagttggtctttccttaggtcacagtatttataattccatttacatctgtataattttta 
aaattaaaaaacaaaagcaaatcaatagaaatctaagttttcttttgtaaaactctcttc 
agtctccaggccggcaccacatgacagtgttgacttgtcctccagacatggacaactccc 
aggatccctggcttacgaaccattcaggcctcgactcattaggaatgctttttggtttgg 

45 ctcacgttgcaagaaattctggagcatgtccatgccgtccagggaccccccaggcttcag 
gattaagtttctgtatttcattccaacctctggattcatgatcccctcttttttaaaaca 
gctgtgaaacatgtccatggaaaacacttcactccaaagatatccataatattggccatc 
ataccctcctgccaagtgtccaaaagtagctggcatatttgtgcctggcgtagctgcaac 
tcccagaatttctgtgcagtatttagcgtattcgctcgcggcatccagagtcgcattggt 

50 atggagagattggtcaactttgctcaaaacaatttggcgcagcgtcagaagacctgtgtt 
gaccagcctagaagcaacaagcttctcgagcagctcgtctgtgatagggtgtccatcttt 
ataatgctttgacagttttcgcagggaatcaacgtcccacacccagttttcaagcatttg 
tgatggcacctctacaaagtcagtttccacgtttgttccactgaatcgtgcaaagtcagt 
ctgcgcacagatctgatgcatgacgtgaccgaactcgtggaagtaagtccgcacttcatc 

55 atgtctcaggagagagggccgacctgctacaggctgagagaagttgaccaccagggcggc 
cacagacatcatccgactgccatcagggagaaggcagcctggctggagaccgaagcaggc 
tgcatggttgtattttccttcccttggatagaggtccaggtagaactgccccaggacctc 
tcctgtagctttatccttcacagtgtaaagtgaaacgctcttattccaaacatgagcatc 
gggcacttgttcaaatgaaagtcccagcagctcctggtagatgcttagcaagccttccgt 

60 gaccacctcaatggggaagtactccttaagggactcctggtccaccgagtacttgagctc 
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ctctgtctgtgtcatgtagtaatggaggtcccatgcattgatcttcccgtcgtattcaaa 
acctcgctcttcacattccttcttcttcaggctcaaaataaactcccgttctgcctcacc 
caagggtttcaatttctggcttaaatcatctagaaaggcggccacgcggctggtgctctt 
cgcagtgttcagttcaaggacaaagtcagcatgggtgttatagcccagcagcttggccac 
ttgagctcgcagcgggaggagctgttgcagaattgcggtgttttcctgtttgcacctggt 
atgaaaagccatttccatcttccttcgagtttcagggacacagcatttcttcatgacagg 
gaagtagtgaggatactttaaggtaactttgtacttgtcttcatctgttttttctaaact 
gtcaatgaagtcatcaggaagagcaccaagttcagccttggagaatacaagggaagtgtc 
gtcctcattgaggttcttgttgaagtcaatgcatagctcactcattctcttcttcattga 
tttgatttcatttcttatgtgttctgaaagatggagtccattcctttttcccattttaat 
tgacttttccaagtatcgcctggcttcaggctttatcttctccaaatcgcatgtttcttg 
taaatgaacaattctctgaaacacatcttctctcatgctcatctcaatatcaaaacgaga 
aagctttttgtctgcttctgtgcttgcagcccgcacttctctgtcagatgacacgtgctg 
agggaagtccagcatggtcctttccactatgtacgtcacttctatgtcagccagcacctg 
cagacagttctcataagttacttctttcagggcgattgtccccacggtgtcgtacacctg 
cttggtctgtgctatgagctgctctgtcctcgtcttgatctgctctggagaaaggtccca 
tctgagaacattcctgccagccgcagtgtaggaagacatagcttgaagaggagaagccag 
ctcctttcccagtgtcattgtcagctgaagcctggagccaccagctctgtggaggcctcg 
cagagtcgaaaggcacagggtgatcatgggcacgccgggaggccggcagcagctggcgcg 
tcgtcctcccgcttgtaggtgcaggaggcaggcggtggtgtctgcgggcccggaagccag 
gagtgggccaagccgaggagaccagatctcgagacggaggccgtcagtcc 

>scr_gb-u66707_2 (TOXMARKER Assignment: 47; SEQ ID NO: 47) 

tttttttttttttgtttatatgccaacatataccttgtgctagaaatactttatggggtt 

acaactctttatatacaattttttttgaggcagtatctctgatggagagcataacttgta 

aagagcttgtgtgtgcttccgtgctccaaaatgataggaaatccactttgagaagacaac 

ttatttgattttaaaaaaacaaaaacaaaaacaaaaacagaaacaaaaccgcaccaatgc 

acagccagaggctccgctggaactgatacagaaccgcgcaaacgccgtgattataagtaa 

cattttccagggtggtcaaggctaacgtacaatattatacacctggcactgatgtttgcc 

attggtcagcaactggcaaaatttgtttctatgtataaatttatttttaaacattatctc 

tggcctgacatatcttcactatttataaaaacatttagacagtgagctcacgttgaataa 

ctaggtctactgtgttctggaagctcttcagtagtaaaacagctttttcgtgttccatat 

gcacaaaactgtgtccatttgcctgaaggattttatccccgggctgtagaaggttggatg 

ctggtccatcaggctgaaccctagtaacaaagatacccttgtcggaaggtttgaaaggat 

ttccttgcccactaattccaccgctgatactaaatccaagcccagggttcttttctattc 

tcacacagaactgctcgggataaccgtccatactcctctgccctttcgtttgaattaagc 

accgtccaggctggggtccccgggtggcctgtgatgaggggatctggatgggcagtggcg 

actggaactgctgaatggtcactttgttgatgttcccttcatatggctgctgctcccggc 

tgcggtgctgaaggctctgtgaccccatcagggtctgaatgtggctgcctgcctttttag 

taatggtgtccggaggtacatcccgcctcccaagtgggtaaggattccattggccactag 

gagagacatcttcttgtccattgtctaaaatgttgctttgctgggacggggtcctgtcta 

accttctagcttctatatgtctaagcagctgctgtctccagtctgccggcattttgccac 

agctctcctctcccttcacaggggtgggccttgtctttatatcactgttatctgatgtct 

tgtcaccatagttacccaagttatagtcagatggtattttttccaggagggctgccatgg 

taggcctagctgaaactggcctggtttgggaggccccgtaactctctgtgctgtagctcc 

tggcagacagtggcctcctctgggtaaggtttttagcaggaaaacttcccgcctgcgctt 

tcacttcttgatatgaagggtgctcatcttcgtacctgccattcctcttgaggaattggg 

gatcagtcattgaaacgctgctctggccttccagccctcccctataggctgctcggccat 

acctgtcaccagggggcagctcgtggggctcactgacccttctgaacatggccatctctg 

tggagctggccagggagtcagccctccttaggaagcctgccctgggcccctggctggcga 

actgagcattcaccatggcatcctcatttacagatggttgggaaaaggagaacatctgct 

ccatgggtgggtatcctctataggctctggggctgacgagatccttggcgatgtttttac 

cgggctgttgtacatactcagaattgtgtgcaaaagggggtgggatcctcttctctgctt 

ttacttccaccgctccttgtggattgaagctttggtcaaactgatagaccttcttagtca 

tagatgctttttgttggggtgggcctttactacttccgtacatgagcatttcgtcatcca 

gcatgggtaccgactggctcctggacatactggacatgccgtgctctggtcccaagaact 

tgtctggccgctcgtggcttcctaagtgatcgcttccagaggcatagttttccagtggaa 

tgttatagaccttgtaggtaccgacgtcaatctcatcaatactctgggactttttgaact 

tgttggactttatatctttcatgagcggggaaagcctctccgtgcttttgctaattgcaa 

taacacctttagacgaatctgggcggcagtggatttgagaaaagacattaccgagactcc 

ggttgggggtggggtcgtggtactcccatggcactcccggagaaaagggacctggtgtct 
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ctgtaggctccttcatgtggtcttttctttcaggcaagggactggtagtaggggttgttt 
ctaatttggagggaaaagcagtcctgtcctcaaatggactgggggttctggtccaattct 
gccagggattggaaggaggcacttctgtttctggtgtgtgtctgtgggtggactgctcca 
gttccaggggaacaccgacaatcctttcttgcctgattaaaggcctgcgcccatgagcag 
5 gtacgcttctagccttggagcttaagagagggttattgttggcattctcgcctgtggctt 
cctcggagacaaaacctgtgttatcgtaatgggagccatcggtccagttgtcagggaaag 
catcactcattggcagccgatcaggacgctgggggaggttccctggagggacagcctccc 
gttgggtgagtaatggctttgcatctagaggctgtgggaaaggtggtgcaatcctgttac 
cccacaaagagttatgcacggcatctttagttgttggctgcaaggaccccactttcaccc 

10 gggtgttggaggatgctgaggaagcctgggagggcgagtagtctgagtaggtgcctgagg 
agacactgttattcagacagtgagttttgtcaacttcagactcatcagttgattcttttt 
tgtccttccctagcagaacaagtttgggtgggtacagaggggtctcagctaatgaagggt 
gaagttccccaatcctcatttcattagctggatgaacaaaagaatcctccactgtaatct 
cctttggggccaccggccacttgtgttcaaacttttctttcacagtttgctccgtgttag 

15 ctgttgggtttgcgttctcaacgcgcactccatggcttggcttacccaccagattttgaa 
cagattttaccatgttctttaaatcctccgggtaaggagttggatatcgttttaggttta 
tttcaacctgtctgccacttagagagggaagagtggtggattgggctgccactggtaatg 
gagcacacatgctcctctcctgctggaggccacttatacaaccccatgccagctgggggt 
cactctgggggacgggcatatcttggatctgctgatcacacctggcccatggtgtgcagc 

20 aatcgccagacagcctggcaggttggagagtaatcccacgctggcccctgtcccaggggg 
cttggcaggagagagccttaactttcccagcactttcgtcatcttcttttttatcctcaa 
attcaaaggcaacagtcatgcgctgttgtctctgctcctcccacagggtggggttgaagc 
tgtcgctgtctgactggaaatcttcatcaccacggggctgctggggaaacatgtagttgg 
tcagtaccctttgcttggtttctggatgggcttctgtttgcagagggatgagggccttgg 

25 actgattgtcagaaagccacaatgctgcaagctctttgagtttggtgaaggagaatggca 
agttcttcaacctattatcacttagatttaagactcgaagtctctgcatctgcccgattt 
cttcaggaagaaattctagcttattggagcgtagagacataacggtgacgttcttacagc 
ttccaatttctctgggcaactctgggaggaaattctcgtccacagctaaggttcgcaggc 
tgtgcaggtaaccaatggtgggagggagggactccagctcattgcaactgcagtcgaatt 

30 cttctaataaagataagtttccgattgtgttgggtagcattgtaagctgattgtcatcta 
cttttagagttgttaactttttcagcaatcctatagagtccggcagctgttgcaacatat 
tggatgatagtaagaggtcctcgagggcttcacatccagaaatatccatgtcaaccgttt 
ctatcctgttttttgacatatccaggtataccaacatctttaacttccctatagacccag 
gcagcacttgcaatgcgttgttatccatccacagctccctcaaattctgaatttgatcca 

35 gaacttcaggcagctcgctgaattcattattgcctaggtcaagtctttccagctgggcca 
gcttgtgcattgactttggtagagttttcaagtgattttctcttaactccaagattcgca 
atttgacaagtcttccaaaattagctggaagaaattcgaggaaggcgtcattcaggtaga 
gctgggtcaggttaagaagctgcgtgaagccatcgggtagtttagaaatgggattgacac 
tggcttcaataatggttaaacacttacagcactttatgttttctggaaattcttgtacac 

40 cgtttttactgatgtcgagttctttcagattaactaggctagcaatggaggtcggcagac 
ttgagaggtcattatcaggaatgcttagtttccttagagcttgacagttgaacaattgct 
tgggtagctcctcaatctgattggcatctagatagagctcttctagtgtacgttcgaagt 
tgaagacctccttgggtacctgttgcaggctgcagtgggagtaatccaacaccgagatga 
tctcttcctcgccacggaagcagcggcatggcaccaggcggccgatgagcttccgtttgg 

45 tggtcatctccaggcactgcattgctagtcactcctgtctctgaagacttctaggctgtg 
ggcactttgacttgcattcttttcatgtagcgggctcactcttcttcaggcctcttccga 
agtgctgcacgggcctccttacaaggacttctctgatattgtgggggattccttccccgt 
attaggttctccatcatcgcagaagca 

50 >scr_gb-af 0173 93_2 (TOXMARKER Assignment: 48; SEQIDNO: 48) 

ttttttttttttttttttttactagtaaggtatttactaggaaatgatacaaacagccag 
gaaaagggtgcatcgcagaacagggtctgtgcgtataagatgggtatttcccctttgtca 
cgtcattttttccatgaagatgcgcttaagataggaagggtaaagtaccgacacgtggca 
ggccccgggtttagggaagggaacgtgagagagacgtcaatggaggcccacaacagtgaa 

55 acccctggaagagggccagagcagtcccctggtgcagtactcagcgaatgcgcatacaca 
actggaaaggccttggcaaattgcccagccctgagctgagcggggtcaggtcgatgtcct 
caggctccaccagcggatgcaacgtgaagttctggagaatggaggtgaggtatatgaaca 
gctccatgcgtgccagtggctctcccagacacagtcggcgtcccgccgaaaatggcatga 
aggcggggctcttcttgaaggattgattggcatccagaaaatgctcaggattgaactcct 

60 gaggggtcttgaattggtcggagtcatagtgcacggtgttaaggagcgtgatgacatctg 
tgcccttgggtatcaggaagcccctgaaaggtgtgtcccgaatgacgcggtggggcaggt 
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tcatggggatgacgtctgcaaagcgctgcacttcgtggatcaccgcgtctgtgtaaggca 
tggatgcacggtcctccagcgtgggcatccgcgaacgtcccaccacacaatcaatctctt 
cctgcacacgggcttgcactttggggtacttcataagaatgaggaaggcatggcgtaaag 
tggtgcccacagtctccgttccaccaaagagcaggttgtgtgtggtcatcagnagggtgt 
ccatattgaagtggctcagtgggtcttgcttctcctgtaccatttttgtgaggaagcagt 
cgatgaagtcccggggagagttggggtccagggagtcctggtgctcgcggacgctgcggg 
cgatgagatctttcatgcccccaaagttccggaacacgcgtctgtgcggcccaggcaccc 
agtccaggagactcgggaagatgttgtacatctcgccccaggggctgctcataatctgga 
agttgtcattgataaagtggataatggtgagcagccgttcatcgtcataatcgaagcgac 
tgccgaagatgacagagcaaataatgttggagaccgagcggctcaggataaacacggggt 
caaagggcttgccttccgttttccgcagcacgtccagcaggaagctgccttcttccagga 
tccgctcctcgatgcttctttttcccatgccaaagttcctcaggatttggacagagaacc 
ttcggaggatcttccagcgttctccatcggagaaggcgatgccgttgcccttggtgaagt 
tgaaaaagatggggtatgagcctcggccactgaactcctcccctttgtccacaagagcct 
ccttcacagtttgatatccgctgaggacaatcacacgcctgggccccaggtacaccgtga 
acactgacccatagtccttgctaagcttggtgagtgaggtcagcaagtcttgggagcgaa 
gctgcagcaggtttcctaggattgggagaggcttgggtcctggagggagctggcccttgc 
cccatgaggtgaaggtcagggacagagagatgacagccaggaggagaagcaagatggctg 
tgctcacaccatccatagtgaaggcagc 

>scr_sc-134241980_l (TOXMARKER Assignment: 49; SEQ ID NO: 49) 

actatatgatcctgtttacatgaaccatacatactaggcaaacctgtagacatagaattc 

agaccttatacatagtccaatagcatagatcacagagcatggagacctgataaatgggga 

ctgaggctgttgggaagaagtgaggaatgactcagcaaccttgggcctggtctccagcag 

gtctcccagaatcagaaaaatggggccattttgaacagaagtgagtcggctgactgcctc 

agcacaatcagcgggctacaaagcaaatcttgtacactgagtctacaagcaacactctct 

gctatggattcctgctcatgctcaagtaccctcatgttgcagagaaagtccaaaaggaga 

ttgatca 

>scr_sc-191609675_l (TOXMARKER Assignment: 50; SEQ ID NO: 50) 

gccggctcaaaggtctctgcgagcgcattggtgttttcaatgacaatcttgcgtgccaag 

tcttctcccaaaaaggcgaattcatccagcatttcattggtggttctgaaatgcgctttt 

ggcagtggcgctggctgggcatcttctccgtgcccaatggtccggttgatcatagcccct 

tgaccgagactacggacaatgatctcccgatagatct 

>scr_gb-xl7037_2 (TOXMARKER Assignment: 51; SEQ ID NO: 51) 

gaacacagacaaggatgtatgtgtgggttcagcagcccacagcatttctgctcctgggac 

tctcacttggagttacagtgaagctcaactgtgttaaagatacctaccccagtggtcaca 

agtgctgtcgtgagtgccagccaggccatggtatggtgagccgctgtgatcacaccaggg 

acactgtatgtcatccatgtgagcctggcttctacaatgaggctgtcaattacgacacct 

gcaagcagtgtacacagtgcaaccaccgaagtggaagtgaactcaagcagaactgcacac 

ctactgaggatactgtctgccagtgtagaccaggcacccaaccccggcaagacagcagcc 

acaagcttggagttgactgtgttccctgcccccctggccacttttctccaggcagcaacc 

aagcctgcaagccctggaccaattgtaccttatctggaaagcagatccgccacccagcca 

gtaacagcttggacacagtctgtgaagacagaagcctcctggccacactgctctgggaga 

cccagcgcactacattcaggccaaccactgtcccgtccaccacagtctggcccaggactt 

ctcagttgccctctacacccaccttggtggctcctgagggccctgcatttgctgttatcc 

taggcctaggcctgggcttgctggctcccttgactgtcctgctagccttgtacctgctcc 

gaaaggcttggagatcgcccaacactcccaaaccttgttggggaaacagcttcaggaccc 

ctatccaggaggagcagaccgacacacactttactctagccaagatctgagcaataccac 

aggagtggattttatggggcacagacagcccatatcctgatgcctgcctgccagggccct 

ccacaccgttctaggcgctgggctggctgtgcactctcccatgtatgctgtgcatactac 

ctgcctggtggcactcctaataaacatgctcgcagctgtgagtctgtcactggccctaaa 

aaaaaaaaaaaaaa 

>scr_gb-bi291805_l (TOXMARKER Assignment: 52; SEQ ID NO: 52) 

ttttttttttttttttttccggggtcaagatatttactcgatgctttcaggtttgaattc 

aggggctcagcaagggggaggggcagggaagggacacacagggcatcttccaatcactgt 

gacttctggcaggtctcgatgtcttcattgccagtggtgactgatcagttgggacatggg 

gagaagtcctgtgccctccacgtctccattgaaatcttcttctgatatttatgcacatca 
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ttgctccggtccccgtcaaagtttccacaggccccacacaacatggccgcataatgctca 
tcaaccatcacattcagatgcccatcctttccaagccacacctggactccggccttctgg 
tggacaaacatggatccgtctgagatcttcctcacagacacagatgttaacacagtagct 
gggagatccaactcggagaccattcacccatgcacccttgcttgggatcacagtcaccat 
gccatcctggaagaagatgtggaccttgctcacgatcttgtcattgtt 

>scr_gb-aj 000696_5 (TOXMARKER Assignment: 53; SEQ ID NO: 53) 

ttttttttttttttttttgttggtttggtttgtttttggagacagggtttctctgtgtag 

tcctggctaccctggaactaactctgtagatgagactggcctctgactcaagagatctgc 

ctatttctgtgaggattcaaagtgttcatcgcaatgcccggcttagaaaatgagtcttga 

aatggcactcagaagggtggatgtggccttttgaacgggcaagtaacacaggtaaaatga 

aaacacaacaggtgcagaagcctgatcaacactcaccgcccagacacctttcaaacaagg 

agctaagtcaatgaggtagaaccccaaatcctccacctaggcgctgacaggcttaaagac 

cccattgccccacacagccctccctcctttgtaaggtcactgagggtacaggacctgggc 

agagacccagagcaaacagaaatgaaagaacaggctttgtaccctgaagagaggaacagg 

aggttttcaactcaaggtaactggatggcagcatttgccggcttcgagtgctgagtggac 

acacgtgcagaaatgacgtgagatgacacgcttagtaaaacgatgatacactttactcgc 

acaacctgaacctctactaaaacccagccagccacaagctgtttgctatcctttattaag 

aggtcccacattcttgcgggactccagccaaaccagacaggtcccctaaatatagcagga 

ggcctggaggggaagggaatgacttaggatcccaccacaccaccctggaaacagaactcc 

accacagacagacggacagacggacggacaagagccggggaggagaacccacctcactct 

tggttctctccccgttgcatccactcaaaaagaaagtcaaacactggctatgcagacccc 

agcccacccacccacccatagcagcgtttgtgggactcccccctgaaacgggtagcccca 

agacaacttcctatggttcttccctgactttggtttgctcctggcaactccgcgccctct 

tccttccctcagcctccagctctctctcagcatcttctaccacctactcggaccttccct 

ctctcttgctctctgctttctggtctccctgccacgggcttcttggggaagcagcgggca 

cctttctcctagcaagggccccactaggccctgtctgcccagcgtgggactcacacagcc 

gccccactctctttgaggtcaggggctgagcgctgccttcgcattcgtggaggggtagtg 

tatggtgggtagcggggccctggccgctgggctgggtaaggttggggctgttggggataa 

gagttgtgcttctggggccgtaagtgctggggttctggctgtgtagaaccccctccccga 

gatcggctccctccatctagggaattcctgcgaggacggtggggccttcggggacttgga 

ggtctgcggttagggggaggagggggtgcagtgacctcttcagggggttggggacctggg 

gcctcaccactccccattcctggccaggactccctgtgctgagggttgctcttgaagggg 

aagcgcagcttgcaatcatgcgggggtacaaagcgagctgggggcctgcgtagtccccca 

ccccggcccccggagccctgcagtccctgcagccgcagctgctcctgcttgagccagcga 

aggcgaccgcgacggaaggcagggtcctcttccatcagtcttgacacacgctcccagctg 

gactggggtggtgaggagggccggacagctggtgagtggtcattggacactgcctcctct 

actgcctctgacccttcgggtggggcccaggtgaccaaaccagattcttcattatcatcc 

tcaagatcctgggtcaacgggataaccctctccatgcggagcatccggtcccgcagggcc 

tgcagctcccggtccttgctgctgttttgcagcttcacttcctgcagaattcctgtcagc 

ttgtcaatatgagcccggaggtcttctacctctgccccacgggctccttcctcaccacca 

cctccaccacctccacatccttcttcttcacccacagtatcccagacatccctggccaca 

gctctccaggcgtctccaggaccctcaggcttgccgtaggtccggcacagctccctcatc 

ttgagagcagccagagcttcaatctctgcccgtccgtgacggaagtcggccagggccacc 

tcatagcagatctccttcactgcttgcatcttcaggtcagccatggtggcccaccggggg 

tctttgccctggagccgccgtcgctgagggatctggtaaactcttcgaggggccctgcgc 

ttcccactgctgggcaggccacagcgcttgacgatggtctggactgtgttagggggcagc 

tcgtcccgcaaggaggaaatcagccgccagctttcttcacaagagcgcttgtcagagtct 

tccccactgtcagaatctgcatacagccgctgctgctccagcaaaaggtcagcctcttcc 

ttttctttccggtactgattctccaagtcttgtagcctcttctccatctctagcttgatg 

tctatgccttgctgctccagccagtccttctgagcaaagttccagtccacaggctcagag 

ggaggtcctgggggtggggggacccctcgctctcgttccagccgtgcttgctccgggtga 

ttgaagcggaacacatggttcttgcccattacaatcctgttgcctgacttcagcaccagc 

ggctccgtcacaagcttcccattgacatatgtctcagctccttcacaaggttccaatgtg 

accatcacttctccatcaggctgagggatgctgcggaagaggcagtgctgctcccggatg 

aactggccagtcaacttgatgtccacatctacctggccaaccctggtgacgccatctttg 

atgtggtagagaaggcattcagacatcaaggggtcctcattcagatttaccaggtgggga 

gtcttttttggagagaagacacccacagtacgccatcctcccggagagcccatctcagcc 

agcaatgcttctctctccatcctcagagcttctgtcttacggagcttctcctcccaagtc 

tcattcagctcagctataattttctctgtttcctgcagcctctccatggcctcctcaggc 

ccaatctggggctcagcactgggtgaaaatgacggctccagctcgccgttatgtggagga 
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ggagatgagggtgaagctggggcagggggagatgatgcagcaggcagaacacctccagga 
ctcccctcttccacctttagacctcctagagcagaggctgaaagcccctgagccatcagc 
agttcccgcaaccgggccacctcctcctgcagctcccggataagccgggcattggggtcc 
tcattgatgacagcattgcatcggatctgtttggtgcggtctgcgtacctgagagtgctg 
5 agtgtctcctcgtaattgatgtcagcgggactcagggctgcaatcattgctgtgcgtgag 
ttcccacccaaattctccttgagtagccaggtaagcacagagtctctgtaagggatgaag 
tccgacttccgcttctttgattgcaaatctgccagggctgagatcaccttccctagagta 
gtcagggacttattgatgtttgcaccttccttcagacgcatgccccgagcccctgaggag 
tcggcccgctcgctcccggcaaggttcaccaagctgatcttactgaccttttctgaatcc 

10 agtccagtaagctggtcatgggagcgctgggtaaagacgatagtaaagacagcgtgggag 
cggctgctggtttcgttcatgttggtggcagccacagttcttgccttatttccacagtcc 
atgaggtcagcaatgtctgcataggaagtcacagccaacttagacaggtcttgtacatat 
gggcccaggatggggtgctcccggacccgcagagagccccgactcttggggttcaagagg 
tctcgtactcgttcgcaatagatctccatatagctcacctccacagagtaggaaagttga 

15 gcactctggttcacattaactcgagagaagaggtcctcgcagagctgaggtacaatgccc 
tgctgccccggttcctgccgccccatcatggtgtaggacttgccagcccccgtctgaccg 
taagcaaagatgcacacgttgtagccttcaaaggcatgcagcagcatctcctctcctatg 
tctcgatacacctgctgttgagatgcaaactgtgggtcctccaccgaagtatgtgaccag 
taagaatagtcgaatgaagcttttaaaaacatcctgctctgtttgggattaatgatggag 

20 gtggtgttgccctgcatgctgaccacacacttggcatcctggctggtctcacgggcatta 
aagggccgaaccctcactgccactttcacggaggcaccagccatagcttcagaatctcct 
gccctcctcagctggtgtcctggccccagatcagcggggctgtatcagttctggctgcca 
ccggccctcgtatgggaagccccatcctacacttggggcctggccacaccagcaaggctc 
ctcgcggcagactcccggcagagagcaaagggacaatactttgctggcgagtagtgctat 

25 gaactctgcgctaccggtgtaagagacgcatcggggccagttcggggctgcccccgcccc 
teg 

>scr_gb-d79221_3 (TOXMARKER Assignment: 54; SEQ ID NO: 54) 
atgggaaaaaaagataacccagggtgtgagcattctcgtgccgaattcggcacgagcagc 

30 attcgggaaaggcaaacagtggctctgaagcggatgttgaatttcaatgtgcctcatgtt 
aaaaacagtcctggagaacccgtatggaaggtactcatctatgacagatttggccaagat 
atcatctctcctctgctgtctgtgaaggagctgagagacatgggcatcaccctgcatctc 
cttttgcactcagaccgagatccaattcgagatgttcctgcggtgtactttgtgatgcca 
accgaagaaaatattgacagactgtgccaggatcttcgaaatcagctctatgaatcctat 

35 tatttaaattttatttctgcgatttcaagaagtaaactggaagacattgcaaatgcagca 
ttggccgctaatgcagtcacacaggttgccaaggtttttgaccagtatctcaattttatt 
actttggaagaggacatgtttgtattatgtaatcaaaataaggaacttgtttcatatcgg 
gccattaataggccagatatcacagacacagagatggagactgttatggacactattgtt 
gacagcctcttctgcttttttgttacattaggtgctgttcccatcatccgatgctcaaga 

40 ggaacggcagcagaaatggtggcagtgaaactagataaaaaactgcgggagaatctaaga 
gatgcaagaaacagcctttttactggtgatccacttgggactggccagttcagcttccaa 
aggcccttattagtccttgtggacagaaacattgacttggcaacgcctctgcaccatacg 
tggacataccaagcgctggtacacgatgtcctggatttccacttaaacagagtaaatttg 
gaagaatctacaggagtggaaaattctccaactggtgctagaccaaagaggaaaaacaag 

45 aagtcttacgatttaactccagttgataaattttggcagaaacataaaggaagtccattc 
ccagaagtcgcagaatcagtccaacaagaactagaatcttacagagcacaagaagatgag 
gtcaaacgactgaagagcattatgggcctagaaggagaggacgaaggagccatcagcatg 
ctttctgataacactgctaagctcacatcagctgtcagttctttgccagaactccttgaa 
aaaaaaagacttatcgatctccatacaaatgtcgccactgctgttttagaacacataaag 

50 gcaagaaaactggatgtatattttgaatatgaagaaaaaataatgagcaagactactctg 
gataagtcccttctcgacgtcatatctgaccctgacgcagggactccggaagacaaaatg 
aggctgtttcttatctactacataagcgctcagcaggcaccatctgaggttgatttggag 
cagtataaaaaggctttaacagatgcaggatgcaaccttagccctttacagtatatcaaa 
cagtggaaggcttttgccaagatggcctcaactcctgccagctacggaaacactaccact 

55 aaaccaatgggtctcttgtcccgagtcatgaatacaggatcccagtttgtgatggaaggc 
gtcaagaacctggtattgaagcagcagaatctacctgttactcggattttagacaatctc 
atggagatgaagtcaaaccccgagactgatgattacagatattttgatcccaaaatgctg 
cggagcaatgacagctcagttcctaggaacaaaagtccattccaagaggccattgtcttt 
gtggtaggaggaggcaactatattgagtatcagaatcttgttgactacataaagggaaag 

60 caaggcaagcatattttgtatggctgcagtgagatttttaatgctacacagttcataaaa 
cagctgtcacagcttggacaaaagtaacacagaagagtcataatgggtgatcagtgtgga 
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cagatgtaaaaagccagacgtgtccttctccatagcagtgccctaacagtgcaacctgcg 
gaatcagtcatttttaaagaaattctatacttcatatactgtacaatgattaaaataata 
aaccatttcagaagtaaaaaaaaaaaaaaaaaaccc 



5 >m61937 (TOXMARKER Assignment: 55; SEQ ID NO: 55) 

ctcaggtttctcacactcctggtaatactgtaaaactttaccatggaccacagttccaag 
gactcctgaacacagtcttggagttaagcctgtgaacagcccacgcttcccatcgatgcg 
taacaagcgatggattccatatctctgcgtgtagcactaaatgatggtaacttcattcct 
gtactggggtttggaaccactgtgcctgagaaggttgctaaggatgaagttatcaaggct 

10 actaaaatagctatagataatggattccgccattttgactctgcttatttgtacgaagta 
gaagaggaagtgggccaagccattagaagcaagattgaagacggcactgtgaagagagaa 
gatatattctatacttcaaagctttggagcactttccatagaccagagctggtccgaact 
tgcttggaaaagacactgaaaagcactcaactggactatgtggatctttatattattcat 
ttcccaatggctttgcagcctggagatatatttttcccacgagatgagcatggaaaacta 

15 ttgtttgaaacagtggatatctgtgacacatgggaggccatggaaaagtgtaaggatgca 
ggattggccaagtctattggggtgtccaactttaactgcaggcagctggagaggattctg 
aataagccagggctcaaatacaagcctgtgtgcaaccaggtggaatgtcacctttatctc 
aaccagagcaaaatgctggactattgtaagtcaaaagacatcattctggtttcctactgc 
acgctgggaagttcacgagacaaaacatgggtggatcagaaaagtccagttctcctagat 

20 gatccagttctttgtgccatagcaaagaagtacaagcaaaccccagccctagttgccctt 
cgctaccagctgcagcgtggggttgtgcccctgatcaggagtttcaacgcgaagcggatc 
aaagagctaacacaggtttttgaattccagttggcttcagaggacatgaaagccctggat 
ggcttgaacagaaatttcagatacaacaatgcaaaatattttgatgaccatcccaatcat 
ccatttactgatgaatagtaacatggtggactttgtcagcatttctatcggaagatctgt 

25 ttatgcattgtgatttgaaagatatcttggatactggtgactgaatgcatcagaccactg 
tttctgttaattcacagtcagctggagcaatgtccacagtgctatgagggaagccatgtt 
tttgtcacactctgaaatggaacatcacgttgcttttccttgtgtttttaaatattcatt 
tattttgctttccatatatgaatattttccctacatgtatgtgtatctcatgaatgtcta 
tgtccatgcagggttgaagagtgttgcaggtcacttggaaccggagttacattgattatg 

30 gagttaccatgtgggtgctgggagccaaacctaggtcttctgtgagactagcaagtgcct 
ttgaatgctgagccatctcattaggtccaaccctaaagatccttgcctgccactatttct 
gtgatctcaatgttttgttttctcctgacttctgacaccaagctgatttgctagaagtct 
tgggcatgaagtgggtgttgaggacagttattgcaaagggatttctgggtgggagttgaa 
agaacgttcaacattcagggaattaattgttcgaggttattgattagtcaatattcccc 

35 

>cszr_2 2960293 5_ 183895355 (TOXMARKER Assignment: 5 6 ; SEQ ID NO : 5 6) 
gtgcacttgtccgaggcacctttgcagacacagccctgggcacatttggagcagcccacg 
gggcagcaggagcagcagctcttcttgcaggaggtgcatttgcagttcttgcagccgcag 
gagctggagcaggtgcaggagccgccggtggagcaggagcagttggggtccattccgaga 
40 tctggtgaatctggagcaacggtgtaagcgacaagaaggcagtttttttttttttttttt 
taaaataaacaggcttttattttccacctgctcggtacaaaacggggtttattaaactgg 
gtggaggtgtacggcaagactctgagttggtccgga 

>scr_gb-af 106944_3 (TOXMARKER Assignment: 57; SEQ ID NO: 57) 

45 ttttttttttttttttttccaaaacaaattcttttataagttgtcttgtcatgttttgtc 
acagcagaaagaaaagccactaagacacttgctaatcccccgttctgtttttttttctca 
aaaccccaagatatatatatatatatatatatttacactcattttacatatgcaaaaata 
gaaccagactcttctccctaaagacttccctgaaaaacctactcagaaccctgcaagtac 
ctgatttctgtttattgagcttctcttccagaatcaagggaataaagacaaaggtttatt 

50 tttcttcactccaatgcctccaggaccaacctggcatggttttcattccaggagctagca 
aaataagggatgaaagtttaggtatcttgcctgctaatttcagtttcctaagggtggaga 
cagctccgtgtaaatgcccagtaaacaggtacttgttgagctaagtcatcaaaggaggag 
cagtgccccagaataaattgacagttaatgatgtcaagtatcttaatgtttatttttatt 
ctttacatccagcacttgaagaaaagaaaatgacatagtgttttagaaacatagtccttc 

55 atgattataactcatcaataccttagaacacacaaggacactgtgagttaatgactacac 
taaaaaataatgggaaattcagcataattaacaaaaatccaagaggaaatttcaggacct 
tgatcagaagctttcactaagtgctggcactatatgctacttcatttcactaagtgctgg 
cgctatgtgctacttcatttcactaagtactggtgctatgtgctacttcactgtagacca 
agcttcagggcaggctaagaaatcttaaccctctgaagacatgatctaagaaatggggac 

60 caagcacttgtagagaattggtagccatcaagaagtccctagtaaggacagctatggaag 
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gagctggccacctttaacctgaacctgtcttaaaattacaaagcccatggagcagtactt 
ataaacacaagcatggtgaggttttgccattctataaataatcttcaggattccagctgg 
ggctctcttttggcatgagaagcttcaggtaaaccagcagacataggatgacctattatt 
gatggaccttctcaaagtactcttttgaagctgttggacttggcttgatcgtaggggact 
5 ctggtgtccagttgggtgggcagacttctccatgggtctccacaaactggaacgccttta 
ccaaacggagtggttcttccacacttcggcccaccggaaggtcattgacactcaggtgct 
tgatgacaccattagggtcaataatgaagagacctctgagcgcaatgccagcactttcca 
acagtactccgtagtctcgggatatctgcttagttaagtccgacaacagcgtgatgttca 
tgtggcccaaaccaccattctttcttggcgtgttgatccaggcaagatgactgaagtggg 

10 aatccacagaaaccgcaactacttcacagtttacgtcatgaaactcattggctttgtcac 
tgaaagcaacaatttctgtaggacacacaaatgtgaaatccaaagggtagaagaaaagca 
ccaagtatttccccttaaagtcgtcgagactcagctctttgaactctccattgacaacag 
cagtacctttaaaatggggcgcatgctgggtgacagcaggggtgtggaatgaagaactgg 
tgctaaaggcaaactttgcttggggacaggcagaccacagcatgtctgtcaagcaggttc 

15 ttctagaagcaacaggcctaagaactgttgaggcagaaatactccggaaaatagtgctcg 
caggccgagccaccgaggaccagagcaaccttcccgcagctgccgccatcttcagagaac 
gcaagagccacgatagc 

>scr_gb-mll794_3 (TOXMARKER Assignment: 58; SEQ ID NO: 58) 
20 atgcccacatttgtgaccagtacatgtttctgcccaccatgttcgagactatcaaagtcc 
agaggggtcatcaatccacttatcccaaatcaaggtgcaccaatcccatttcaacgcctc 
tgccagccccttatttccaatgaacacagacaaagctgggttaatcaagtcaagtttttt 
tattttattgtcagttacatgctttatagaaaaaagtgtggagaaccggtcagggttgta 
caaaaaaaaggctaggttcctacgttgttttatttacaccattgtgaggacgcccccact 
25 tcaggcgcagcagctgcacttgtccgaagcctctttgcagatgcagccctgggagcactt 
cgcacagcccacggggcagcaggaacagcagcttttcttgcaggaggtgcatttgcattg 
tttgcatttgcaggagccagcgcaggagcaggatccatctgtggcacaggagcagttggg 
gtccatggcgaatggaggcggcagttggagatcaacgagagatcgctgtagagttctagg 
agcgtgatggagagaagcacgcggagcgcgacctttatagcccagagtattgggtcgcgc 
30 gcaaaagctccgcccgggtggcggggcgccacctgccctcctccccactgcctgcacacg 
cccttcttctggctcaagggaaatgg 

>scr_gb-af 069306JL (TOXMARKER Assignment: 59; SEQ ID NO: 59) 
ttttttttttttttttttaggaaaagcgactgctttaatgaattagacaaaatttcacat 

35 gaaatcagaatcctataatccttcccttctgatcactaaaaaatgcaagattcattcgtt 
acaagccatgtgcgattcggacccctcgaaggcagtgcaggtctgcggtccagcctcagg 
tgctgcactatttcccattctcagcgctgaacattcgttctgtgagcatccgctccaact 
ttatggcatcagcagcaaacttgcggatcccatcagagagcttctccacagccatttggt 
cctcattgtgcagccaacggaaggccttctcgtccagatgtatcttctccaagtcactgg 

40 tctgggctgctttgacggaaagcgtgggtgccagcttgctgctgtccttgagcagctccc 
ccagaagcttgggtgagatggtgaggaaatcacagcctgccagcgctttgatctcacccg 
tgttacggaaggaagcacccatgacaatggtcttgtagccaaactttttgtagtagttgt 
agatttttgtgacactcttcaccccagggtcctcctggggttcgtaggatttcttgtctg 
tgtttgccacatgccagtcaaggatgcgccccacaaagggagagatgagcgtcacgcccg 

45 cttcagcgcaggccacggcctgggcgaaggagaaaagcagtgtcatgttgcagtggatgc 
catgctgctcctccagctcctttccggcctggattccctcccaggttgatgataacttga 
tgagaattctgtccttgctgatcccagcttctttgtaaagctctatgatgcgcctggctc 
gggccaccatggcatccttatcaaaggaaagccttgcatcgacttctgtggatacacggc 
ctagaatcttctttagtatttctgccccaaacagcacaaaaagtttatcaatggcatttt 

50 taatctgctcctcttgtggcccacccagcttcttgccgtaggcaatggcctcctccacca 
gctcttggtaggcaggcatctgtgctgcagccaggatcagggatgggttggtggtggcat 
cctggggcttgtactcatcgatggcgttgaaatcacccgtgtcagccaccacggtggtga 
actgcttgagctggtccaaggcggactccatcctctggcgctttaccggggaccccgaca 
tggcgaaacgcgcacagctgaggcggtagctggt 

55 

>scr_gb-dl7310_4 (TOXMARKER Assignment: 60; SEQ ID NO: 60) 
gcactctccagcctctcaccgacttttttttcaaggagacaattttatttttttaccaag 
gctgaatttataccataacatgggtaacagagggaggggggaagtgtgaaacatttacac 
aggccaagggcacagtatacatgtagtcagctgatgtcaacaggatgttggtttttcaga 
60 aagcttacaggtcatcacattgggtatcttgatgtcagatgtatttctcagcaaggtcag 
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aactttatcatatcattattcatcctgaccaccagatttgtattagtcttctgcagctgg 
ctggggattttccatgaacccagtcatacttaattctaaccataacatcaataatggagg 
gtttcaagggcattgctcccaacatgtaattacaaaaagaaaaaagatgatatatttccc 
aaaaagagagacacattcaaatttcctctcaaactccccacatctgaatcatgatgatgc 
ttttaaattggttctcttcttaccaacattccaaccttcccacaagaacttgctctccag 
gttcttggagctctggttcttgggctgttggagagaaccctgggtctcttggtcactcct 
gccacaggtgccctacctcaaaactaagaaaaagggaaaatctatggagtactttcttct 
tcctcaaagaatatggggaatattgactaatcaataacctcgaacaattaattccctgaa 
tgttgaacgttctttcaactcccacccagaaatccctttgcaataactgtcctcaacacc 
cacttcatgcccaagacttctagcaaatcagcttggtgtcagaagtcaggagaaaacaaa 
acattgagatcacagaaatagtggcaggcaaggatctttagggttggacctaatgagatg 
gctcagcattcaaaggcacttgctagtctcacagaagacctaggtttggctcccagcacc 
cacatggtaactccataatcaatgtaactccggttccaagtgacctgcaacactcttcaa 
ccctgcatggacatagacattcatgagatacacatacatgtagggaaaatattcatatat 
ggaaagcaaaataaatgaatatttaaaaacacaaggaaaagcaacgtgatgttccatttc 
agagtgtgacaaaaacatggcttccctcatagcactgtggacattgctccagctgactgt 
gaattaacagaaacagtggtctgatgcattcagtcaccagtatccaagatatctttcaaa 
tcacaatgcataaacagatcttccgatagaaatgctgacaaagtccaccatgttactatt 
catcagtaaatggatgattgggatggtcatcaaaatattttgcattgttgtatctgaaat 
ttctgttcaagccatccagggctttcatgtcctctgaagccaactggaattcaaaaacct 
gtgttagctctttgatccgcttcgcgttgaaactcctgatcaggggcacaaccccacgct 
gcagctggtagcgaagggcaactagggctggggtttgcttgtacttctttgctatggcac 
aaagaactggatcatctaggagaactggacttttctgatccacccatgttttgtctcgtg 
aacttcccagcgtgcagtaggaaaccagaatgatgtcttttgacttacaatagtccagca 
ttttgctctggttgagataaaggtgacattccacctggttgcacacaggcttgtatttga 
gccctggcttattcagaatcttctccagctgcctgcggttaaagttggacaccccgatgg 
acttggccaatcctgcatccttacacttctccatggcctcccatgtgtcacagatatcca 
ctgtttcaaacaatagttttccatgctcatctcgtgggaaaaatatatctccaggctgca 
aagccattgggaaatgaataatataaagatccacatagtccagttgagtgcttttcagtg 
tcttttccaagcaagttcggaccagctctggtctatggaaagtgctccaaagctttgaag 
tatagaatatatcttctctcttcacagtgccgtcttcaatcttgcttctaatggcttggc 
ccacttcctcttctacttcgtacaaataagcagagtcaaaatggcggaatccattatcta 
tagctattttagtagccttgataacttcatccttagcaaccttctcaggcacagtggttc 
caaaccccagtacaggaatgaagttaccatcatttagtgctacacgcagagatatggaat 
ccatcgcttgttactcatgcaaccaagcaggtcttgggtctggcgagggtcttctgactg 
ttctgagacagccctgtgtgaggaatgcactttcacagggttggaggtacttccaagacg 
ccataggaaccacacgtgggtcacagctatcagttcactgtgggcaagaaacctctttat 
ggccacctggtaacaaaaatttttctgtctgtgaattttttcttactatttaaa 

>scr_gb-bf 281368_2 (TOXMARKER Assignment: 61; SEQ ID NO: 61) 

ttttttttttttttttttcacacagggttgcttttatttccacatccaacttgagcagag 

gccctgccacaacctgaacagctgtgaggtgctgggtgcctccagagtttctggcacagt 

aagtgttgggtgtgcagacttcctgatggccacatgacactggcccacacaggaacagca 

agtccatgaatggaaatcccactgagctggaagtggaggctctggaaaccccatgggcag 

cagcaggagttaaaggagccaccaggaacactgcagtgaggctccaatgcagacagggct 

gataaaaacccaaacagggcattgtgagagcagaggctcgagtgtccccgctgaggaccc 

ggggctgaaggcacagagctgtgtcgggatggaagaaccctgggtgcactcgcagtccag 

agcacgaaagcacaggtgagaacccagcccgaggctctctgtgaagagtgtggccttgga 

tcttgggcacggcacagtgacacacagtgctgaggtcactcctgacttcccagaggaatg 

acctcttcagtgacaaaaaactcaatggtctcttcctcccagtcatccacgttgctgtcc 

agctcgtcagtgtccacccctccccgtagctctagacgctcgttcttctgcttcatatag 

agttcctgggccatttttcggtattgcctgaagtcctccatcatggtccgccttctttcc 

accagttcctttgaagctttggactggctcaagcgatccttctgctcaaagatcttagag 

tatttcttcagatcctttttaatttgctttatctgatcctgactgaggagtgttgggggc 

cttggtctccagagcagctggcagaagcggtccttgttgttcttctggagaagacgacct 

tggaaggtccacagccaataagcattgtccaccttatggctccaccacgacacagaggta 

accacatagcggccagttgggtcccattcgacgtcggaggccatgtagtgctctgcaatg 

ttcatgacggtgcagtctgaagtgtcgacaaacgccaaggcgccattcatgctcctcagc 

ctcgtg 

>scr_gb-u56407_3 (TOXMARKER Assignment: 62; SEQ ID NO: 62) 
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ccaaaccaacaaggcagccacaggccgtcggtgcctgccgccttccaccaggggcccgcc 
aagacaaccttccaccatggctttgaagagaatccacaaggaactgaacgacctggcgca 
ggatcccccagcacagtgttcagcaggtcctgtcggggaagatatgttccattggcaagc 
tacaatcatggggccaaatgacagtccctaccagggtggagcatttttcttgacaattga 
5 tttcccaacagagtaccccttcaaaccacctaaggttgaatttacaacaagaatttatca 
tccaaatgttaacagtaatggcagcatttgtcttgatattcttcggtcacagtggtctcc 
agcactaactatttcaaaagtacttttgtccatcagttctctgttgtgtgatcccaatcc 
cgatgatcccttagtgcctgagattgctcagatctacaaaacagatagagacaagtacaa 
cagaacagctcgggaatggactcagaagtatgccatgtgactaaagagattattggatcc 

10 tctgcgaataaaagctaggggaactctgaaagagaaagtccttttgattcccacttgact 
gtttgctgtgaacccacgatgtaccggcctcgtcctccctggtgcacggtcttcatctga 
tacagtactgttgcatgttgcacgcaccaaaaatactgtgtttctgtaccaacactgtct 
cctagcagacgagccttctccaggcataacctaggtgtgagattaaaagttttccttatt 
gacttaaatctggataacaaggtgtgagtgagggtggtgggtacaagatactgctcagaa 

15 ggggtaaaggtccccaacctataagacaatgagatggcttttcagtggaagccatttaca 
gctaaatgtttaaatgaatgaaaagctaggtgaagaacatgaatgttcctgtactcattt 
tattccaaaagacctagagcttaaatgaacattaaagccaaccagactaagccaacccac 
ctcctgtattttaaagtctaattggtcaacaaaaatagatcggcactatcggtccataaa 
gtgtgcctggctttgttcccaaatcctttatacacggatgactcaacctattttctttca 

20 cactttctctccatattctttggtttacttgcggtttctcagttgattcatcactaatag 
ctcttatttttattatattaactgcttaatctatttggatgtaaaggtagacattcaact 
tgatgaaaaaagcttgtgtatagagacctaattgctcctcttggagcttgtacagtcaag 
aatgatgcatctgtgtaataaaccaattattctagccattat 

25 >scr_gb-ai406674_l (TOXMARKER Assignment: 63; SEQIDNO: 63) 

tgtacactacccctcacaaaccacaagccgcagcaacatggatgcccagtctggagcagc 
aacagccaggatgacctggagccaggggggcttcggaacagatgtgcacccttcctgggt 
gatgttttcagctttgtgagaaaccttactatcagaggagatggctagcaatgttaccaa 
caagacagatcctcgatccatgaattcccgtgtattcattgggaatctcaacactctggt 

30 ggtcaagaagtctgatgtggaggccatcttttcaaagtatggcaaaattgtgggttgctc 
tgtgcataagggctttgcctttgtccagtatgttaatgaaagaaatgcccgagctgctgt 
agctggagaggatggcagaatgattgctggccaggttttagatattaacctggctgcaga 
gccaaaagtgaaccgaggaaaagcgggtgtgaaacgatctgcagcggagatgtacggttc 
ctcatttgacttggactatgactttcaacgcgattattatgacaggatgtacagttaccc 

35 agcacgtgttcctcctcctcctcccattgctcgagctgtggtgccttccaaacgccagcg 
tgtgtcggggaacacctcacgaaggggcaaaagtggattcaattcaaagagtggacaacg 
gggatcttcttccaaatctggaaagttgaaaggtgatgaccttcaggccattaaaaagga 
gctgactcagataaaacaaaaagtggattctctgctggaaagcctggaaaaaaaaaaaaa 
aaaaacccctcgtgcc 

40 

> s cr_gb -bf 2 9 0 6 7 8_2 (TOXMARKER Assignment: 64; SEQ ID NO: 64) 
ttttttttttttttttttgattttggccaaactttttatttagtattttgtagttgttta 
acacacacttaaatggtcttactcggggagggggaagggaggttcttgtanattcccaag 
gaaaggtcagaaaagcaaaatatggccagcatccatttgctttttttgaggggggggggt 

45 ttctgggtaaatagtacatgcctaggcatctgatctcagcttggtttgtttgtttgaata 
tatatatactgcgaacattgagatttcagttggaagacaccctgaaatcctcacacccca 
ccaaccctctctaatggctagcttgtctgcacaggcagggtgattcaactctcaatggag 
accaaaggacatctagatggctaaatgtttgtggaagatcttggggttgcttgcctcatt 
tgctgggaaaaatcaggaagtggccttcagggacacttttacttggaaaattacaacact 

50 agttacaagtcacgggttacacatctaacatttgcttgttgaaagcaactcataatagca 
aataaaattaaacatgtcttactttttccctcacaagaacataaaaattattaaggggaa 
caggaaattttaaaaaggtaacacaatttttcctttagtagtccttgggtagtttatgac 
agaaagtttccatttttttgtttgtttctttgaatggggattgttggtccctcgtg 

55 >scr_gb-bi288503_l (TOXMARKER Assignment: 65; SEQ ID NO: 65) 

tgtacagttgctagtttgaggctggtgttgatgttctgacaagagtggctcagccatggc 
tcagtagagtcctcttctggaagtttgagaaattctggcttacgggaaaaggtttttctt 
tcttttcaagatatgtccaacaaagtcctcttcggtcagtaatttctgcagtgacgcctt 
tcgtccgtcctgtcagcaaactccaatcgcaacttgggagtccagtcaataaagggttaa 

60 gcgcacacaagcgtggccaactagtaggtccgagaggttcaccggcaggcaccgtactta 



105 



atatgcagaggggtgggcttcacgcctccccgccgagcgctcccacggtcgaggagttgg 
tgggcaaggagatgaggtttaagtccaatgggttaaacccaaccccgagagggttaaaac 
tacccgatgacgctgccacggaggggccgaatccac 

>scr_gb-d86383_2 (TOXMARKER Assignment: 66; SEQ ID NO: 66) 

ttttttttttttttttttcacaccagatgacgaatgtatatgaaagttaattcattaaat 

taaaaaaaaaaatcaaacatttggggagggttttttttacaacgaataattctatacaca 

tgctatagacacggtttctataaaacacactatctacaatctacttacatttaattgtcc 

tgctatttctagttcatgtgagatcagtcacaagtgagtcagtttccctgcctgtagaga 

ctgcgtcatcccttaataccagggtcagaggcactggccgagcaaaacaagattgtaaga 

atcttatcaactatcttgcttatgagaacagacaccaggggccaagtgctctgaaccggc 

tttggagttaaggcagcaatgtaaggtgtcacgtaaaaaccaagtgtgctctttgaaagc 

attccatggatccccaaatgctggccccctttctaagtgcacctctgaagtcgagggaac 

agctacacatttgggaaaagtcattcgagaacagccgcccaaaacctttaaagttatagt 

ttaagcttcaggcaaaagttcaaattacttctcacaaatagaaagaattcactttttaaa 

aacgaagtcacatttagccactttatcaaaacaacttaacaccggtacggaaaacgtacg 

ctaaaccaaaagtatggtttcaatgcacgccgtgccaaatattttcaaaacgctagaaga 

atggtacttctttctctcagaatttcccagtttgtctgtagcagaacggtattctaaagt 

ccagtctctgaacatggtcacggccgatgactgtcatccagcattaaaatagcctttatc 

accctcgatgtccacttcctggtcggaatcctctgagatctctgattcagggtcttccgg 

agaggctggggagggtgaacactgagaactgtccaaagaggcacctttattctgttcact 

gggcaagtcttggccctggtcacaggaagtgtccaaactgtccaactcatcctttttatt 

gctttgaggattctcctgcttcagtcgtctccatttagctctgcgattctgaaaccaggt 

tttgacctgtctctcgctgagctgcaacatcttggccagacgctttctctcaggtgggga 

gaggtatttctgagtctcgaacttcttctccagctcgatggtctggtcgttggaaaacct 

cacttgaccgcctttccttttgtgcagaggtcgctgtaggaaggggttccagagcaaggg 

cttgcccagggggtcgtggcggagtagggcgtgcgtgtagtcgttcaccgtccgcgggaa 

cgggtacagagggcctccgaagccaccggggccataggcagcggccagcgcggcggcggg 

gtgatgcgagaaggcggggtggaccggcgtgggctcgtacaccggggtccggtaggagga 

cacgaggctggtgaaggaggagttgggggacggcagcgtgggagtgggcgtgggagcggc 

gggcccgcgacccaggatgtcgtcgatgtagaaaggcgtcgggtgagcgggctgcagcag 

cggcgtgggcgcgtacagcgggactccgacggcgggcgcagccgcgggccccgggtgcgg 

gaactgcatggctgctccgc 

>scr_sc-133366194_l (TOXMARKER Assignment: 67; SEQ ID NO: 67) 

gctagcatcttttttctgccacgaggtgcgttttattttcatcaatcatacaaatgattt 

tccatatcacagggcaagctgagtgcctgggtgtgttcacagtgtagcttgtcgcttgtg 

tctgtccatcttccccgtcagaatggggtctcagaaatgatgaggtgaggtggagaaatc 

ctcctaggcttgtaggaaattttactcctcttttcctgttgaatggtcttttggttggct 

ggtgttcttctcatgctctttggttttctccagtgtggctttattgaagcttgtgatttc 

ccccatggataacttgcctgccattttcttagaactcttggaatcttgctctgagctcat 

gctccaattg 

>cszr_230290139_182026368 (TOXMARKER Assignment: 68; SEQ ID NO: 68) 
aagctttggagctgctaggtgctacctatgtcgataagaaaagggatctgcttggagccc 
tgaagcattggagacgggcaatggaactccgccaccagggtggggactaccttcctaagc 
ctgaaccccagcaactggttctagcctatgactattccagggaggtgagcacgccccaag 
agttggaagccctcatcacagatcctgatgagatgcggatgcaggcactgctgatacggg 
a g a gg a tcc 

>scr_gb-ai013477_2 (TOXMARKER Assignment: 69; SEQ ID NO: 69) 

tttttttttttttttctaagaagctgttctcatctatgaaccagatggcatctaccccat 

ctgttggctgatcagtccgatctttatgccactcctgtgctttagtgagcacctggtgac 

agtcatgatggggggtgtctaggtcagggtccgggagcagggttgtagggtttagactcg 

taggggcagtctgggatcacaaggaacaagtgggatacccggcccacgccaaggtccacc 

gttcttcgggtagtccatgagtatcatttgttgtcagtagccccttgtactcaaggtctt 

ttgcttgacactagcccatttggacgtaggagcacagagtgttgggcccccgtattcaca 

caacaactgggcgggcttcccttctatctttttgcatagccagcactctaggaccaagag 

gcttgccttccaggctgctggagaggcccctcttgttcttcctggggcagtccctgaccc 

agtgtcctttttctttgcattaggcacactgatctttagccaggaattctcttctgttgc 
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caggtactgtcttcctaggttccctaactactgtggccagtatatgttcctctcttgtct 
tttatctctctttagctctctagcttcctcttctttttgtctcttttcttccctagcttc 
ctgctctttttaccttcttttctctttctctttgtttaaccttactttctctgtaactta 
tactaactctcagcaacttagcttaacccttcaaatttctgtaactttctcttcataccc 
5 tttccttatcttagccagattggtggggcattttccagcccctaggagacccaccctcgg 
agcctgggggcagacctggagcactccctaccttcaggggcattgaagtcaacagtcagg 
agccttccatccatgtctggaacattctttctggcctctagcaggattctgtctttcctc 
agtggtaaagaagatctgtaacagttactaacaagcatctcacgtgggatggtgagaaaa 
caagaagggaatctagaggagagaggtccactgaagaggacaaatagcatttagtcacac 

10 agctaaaccaggaggcctttttttggacaaaaaggccactgtaaatataagcacaagctt 
tgtctatgaaacagaaaggcgagcagagaggcagcctagctgttaccggctgtctctctg 
ggcttagattttcccttaaggagtacctacctcccttcagtgtcagcttggtggctttgc 
ctctcaagagaaccagcctccaaatgacactaggcttctagtaacaactaataacaaaag 
gatggagagatggttagaacctgggtgctagatactaagcagctgacaaaagaattgtaa 

15 ccagttcacctggggctttcaggactttagtaacagccctttaccaaactgtctcagtgg 
gctataggcccatggaaaagaaaacattaatcctgaccttgtccaccaccaaagcctgaa 
ttctaacctcgtgccg 

>scr_gb-m91235_3 (TOXMARKER Assignment: 70; SEQ ID NO: 70) 

20 aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaacagtctctctgcatcttcttctacagcta 
ttaggtgctgtccacttttctgcacagaccctgaaccacgcatcaacttattttctctgc 
aacttacaataactctctcagtgacttagctttacccttcaagtttctgtaactttctct 
tcatatcctttccttatcttagccagatccagattggcgggggattttccagcccctagg 
agacccaccctcggagcctgggggagacctggagcactccctaccttcaggggcattgaa 

25 gtcaacagtcaggagccttccatccatgtctggaacattctttctggcctctagcaggat 
ctgtctttcctcagtggtaaagaagatctgtaacagttactaacaagcatctcacgtggg 
atggtgagaaaacaagaagggaatctagaggagagaggtccactgaagaggacaaatagc 
atttagtcacacagctaaaccaggaggccttttttttggacaaaaaggccactgtaaata 
taagcacaagctttgtctatgaaacagaaaggcgagcagagaggcagcctagctgttacc 

30 ggctgtctctctggacttagattttcccttaaggagtacctacctcccttcagtgtcagc 
ttggtggctttgcctctcaagagaaccagcctccaaatgacactaggcttctagtaacaa 
ctaataacaaaaggatggagagatggttagaacctgggtgctagatactaagcagctgac 
aaaagaattgtaaccagttcacctggggctttcaggactttagtaacagccctttaccaa 
actgtctcagtgggctataggcccatggaaaagaaaacattaatcctgactggcaaaaca 

35 aagttcttcacagttgtagattctttgaaactattttaggggctctttttgtcccccaac 
ctggggcattttaaccataggggcaggaactggctgctgtggggataggaccaaaggcac 
tctccatgttaatgatgatcagtggagaaaagtaattttgatgttggagactactcctcc 
ttggataggacagcagataaggaggcttcttaagactcttaatgagcgctctcctacttg 
agcgaaattcctttcctgttctgttttcctatagccccactagctctccagcctttttag 

40 tcattcttccttgacgatttctaacacagcctgtcctttttttatagcctgttaacagca 
tttctgatcttttaggcagctatcgactaagtgccataccgggtgaaactccgcctttaa 
gattccttactcccaaggaaaatttaaatctttcccagttcatcacagctggctgcgagc 
ataagcacagaataaaacactatatgtttttgttttgtttttctttccttttttcactag 
gctggggcccgaacccagggccttgcgcttgctaggcaagagctctaccactgagctaaa 

45 tccccaacccccaaaacactatgttttaaaaattaactttggctatcaaccaacacactg 
ccactagagcggggtctctacaaaattaagtttcttactcactaagcgttaaggggacca 
agtaaaactcttcgacgaacaaagcaaacagtttcatgatttcaaacacagtcgtcggtc 
caagattttaaacacagtcgtcagtccaaattcaaacacgaaacaaaagtcaaaaagaca 
ctaacagacacaacacgtccagaaaaccacagtcaggtcacaaagaagacaaacaattcc 

50 aacagtcaaacaagtaacaagcagacgcgccgcgcagcttcggtaccaaactgaaaccaa 
aaaattcagacggagtcatcaagggtgcggatccctccgaaaacggacggaggtgccacg 
gatccggatctccctctcctccaaccacccttggaacgtcttccagggctgcgggggaga 
agtccgagctcgtcagctccttctctggcccgcccagatagtccccagatctgagcctat 
tgatcgatcgttcacaggacaagacaccctcgtgcc 

55 

>cszr_204152648_191521095 (TOXMARKER Assignment: 71; SEQ ID NO: 71) 

tcatgacctcattttaggaccaagagctgtgttggtttcttagattgttagctttttctc 

taga 

60 >cszr_204152792_191517979 (TOXMARKER Assignment: 72; SEQ ID NO: 72) 

tctagaaaacggaggctgtctggatgcagtagtcatttgctgcagaggttggggaagggg 
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aggccccatgtttctcctgtggaaagagggtgtggggctctgggaaaaggccactcttca 
aacattcatga 



>cszr_204229614_191891958 (TOXMARKER Assignment : 73; SEQ ID NO: 73) 
5 gctagccttatgccagcctgccactgtcaacatattctgttcccattggttacatgcttg 
atacatacactcttgtgtttttggctaattgagctttttaattctattgtaatattttca 
attg 

>CS zr__2 0 4 2 2 96 1 5_1 9 1 8 92 5 1 0 ( TOXMARKER Assignment: 74; SEQ ID NO: 74) 
10 caattgaaaatattacaatagaattaaaaatctcaattagccaaaaacacaagagtgtat 
gtatcaagcatgtaaccaatgggaacagaatatgttgacagtggcaggctggcataaggc 
tagc 

>scr_gb-aa801331_l (TOXMARKER Assignment: 75; SEQ ID NO: 75) 

15 tttttttttttttttatgaagacacgaaatgcatttattcacataacaaaaaacaaaaac 
aaaaacgaaaaaaacactcactccctcttcacttgaaatgtgtcagtaatgactcaaagt 
gtcatgatttaccaggtggtgaattcttctgacaaccaggtgaagaattaggaaaacata 
cagttccagtctttatattctgaccctagaaatcggttcatttgtagctcttgggggtac 
acagtaaagcaggcaagcaactgtccacactgtttcattccacatacttagtgagtgccc 

20 ttattcagggcctaacttcactccaggcacaaaaacaaggcaggattgcctggtaagtct 
gaacatgagaaaagaaaacgatttattacacaacagatatatccatttatgtgagtgttg 
acatctaggaattctctgctttatagacaattagaagcagcatcctttctttagaatatt 
tctatgccctcactaaacccatgagtaagtatcttgcttgggagtcatacccagagctaa 
ttacaattcaatattctccctgtacatgcaatccttgaaaaacgttatatgtattttatc 

25 tcattttcataaaagaattacaaagaccccaaaaaggtttagtgtttgtttgcatattaa 
ggttgcaattctccagaaacccaaagttcggatagtatgtgacgttgtgcagacaatagt 
ttacctcatgctacaggctataaatgtcagaacagagcttaaacactcacattagtgaac 
gcattggcactacttgtactctttattttaagggctaagaaaaagcacacttctactcag 
ccctatggaagttatcagtgagcacattctctatcgctcactgtacagtaaactatgtac 

30 aacaggcactataacaaacagaattttagagtcaggtatgacatgaaactttttcaattt 
tttatatttacactgtgggtttatcctcatcttaagatcagtttttcattttgttttgtt 
cttctgtttttttggttttttttctgcctaaacggtatgctcaagtagcatggataaatc 
ttccagaatatgcactgagtaactccttggctcttcccagagccttgccttcagcacagc 
atgatgttaaaagatggtctcattgtagacatcaaagtaggtagaagaacaattgtgtct 

35 gtatcagaggctctatgaagagacctggagtctcgaagttccttcttactac 

>scr_gb-aa899865_3 (TOXMARKER Assignment: 76; SEQ ID NO: 76) 
aaattttggctggatccacagcaagagtcctcagtattatttatttgttttgttttgtgt 
tctgttttgtttttactgcaacttgacaataaaagatgtttggcattggaagagaataga 

40 acattaggtctgggcccagcgctctgactccgtcttgtttaatagtttaacctgaagtcg 
caagactgggataaacaggagagctgacatgaaggacatcatcgcacatgtttcggctta 
ctgtgtcagaactacacgtgcttggccttatttctttgagcctgtggcagaagagtgtat 
cgaggcagaaagcagaaaggtccaacctccctttctagaaaatgtcccttgatgtcctga 
ttctcttccactagtcctcactactaaaggtcctgtcacctctcagtaacactgtgggcc 

45 gggaaccaagcctcagggcacaggcctttgggagctgtattagagttctcgagaagaata 
aacagcacttgcagaaggaattcccagaagaagaatgacttacaggcttctgtccagcta 
atccaacagtgagcagaaagtccaaaaatccagcagttcgggccatgaggctgggtgtct 
cggctggtcttcagtagactctggaatcccaatgacgtaggctctaacgccagtgaagga 
atggacttgccaacaaggtgaggccaagcaggcaaagagcaaaagctcccttcgtcctgt 

50 cctcaagtagacttctagcagaaggcgtggcccagactagaggtgtgtcttcccacctca 
agatcaggattaaagaagatctactgacttcaaattaagcaaaactccctcccaggtgtg 
ccctctgtcattagattttagttcattcaagatggagtcaagttgacaaccaagaatggc 
catcaccggggacactccacatataaactgtataccaagcttcatatttcagacatgttt 
cttaatgtcatccacgtctccagcccctgtagtgtgtatgtgttgtattctctgcagaat 

55 ttagcatgcccgtgtttcctgtccttcacataaacgcctttgtgtgaagcttgcttgatc 
ctccactcccctctccagcccccacccctgtgacactgcccagtaataactgttcgttgt 
ttacctgttgcttgtaagtgcaagtattaaagcaatttgaaagctaaactcacctgtaag 
actataataaatacctgtaatccaataaaaaaaaaaaaaaa 

60 >scr_gb-aa997629_l (TOXMARKER Assignment: 77; SEQ ID NO: 77) 

tttttttttttttttttaaataataatgttactgtcgtgttggctgtatatcattgcata 
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tacttcaggaaaagttttcttgttcttgctaaataacaaagcacaattggtaagttccat 
ggacagcaggctccctcagaacgtagccagttctgtgaggcaccccatatcccaaggaca 
agcttgtggcatgccagatgaacagcagccttggcttacacgcacacctgtacataaaag 
ctcatctttccaaccacgtgcagccaagagattaccacagacttgacacagggaccctaa 
caggctcctatagacagtcctgccgctccatgaagtggggaaggaacaaatgcagtgacc 
gcatctaatgcacttcctttgaaaatgtttgcttat 

>scr_gb-aa997691_l (TOXMARKER Assignment: 78; SEQ ID NO: 78) 
agatctgagcggccgcccacggtcctgatgacagaagagctctcctccccgaaaggggca 
gtccggagcccaccagtggataccgccaggaaggagataaaggcagctgagcacaatggt 
gctccagaacgcacagaggagatgaggacaccggagcccctggaggagggtctagcagag 
gaagctggcagggctgagcgcagtgacagcaggggcagcccacagggtggccggcgctat 
gtgcaggtgatgggcagcgggctgctggcggagatgaaggctaaacaggagcggagagca 
gcatgtgcgcagaagaagcttggcaacgatgtcatctcccaggatccctccagcccagtc 
atgagcaacacagagcgattagatggaggggcaacagtgcctaaactgcaaccaggtctt 
ccagaggcccgctttggtttgggaacaccagaaaagaatgccaaagctgaacc 

>scr__gb-ai411514_4 (TOXMARKER Assignment: 79; SEQ ID NO: 79) 
ttttttttttttttgtttgtgaaagtacagaaaactttattggaaatctcttgattatat 
ttccaagtgtagctctcatttcctaacaaagcactggaggaggggcttcacagccacctg 
gtcccagcctgagcttggctgcgggagttgtctagagcccgtttcttccattgtgtagga 
ctgaggggcacaggccaccttgaaggatgcttcgctcagcttccctggcctctttcttaa 
gaatctgggacataaaggctgctgtctagaggccactggctgagccctgaaaagaatccg 
tgccctcaccccccttttagtgctggccctggggggtaaatcctgttcagtaggctatga 
atgtgcccctgacccaaaggctgcaatggcacttggccaccactgctgggcacatttctc 
tgtggcagcaaaagcatgcacaggggaaaggctccagtgttacatgcagattactaacag 
cagttgagagccacctgctccaatgcgtaacggctgctgccagtgaggatccagggacaa 
gaacaggacaggctggcagaggcacttgactgactcaagcaacaatacctgaaggtttaa 
gtcaaccataggctcagctttggtttctcaaaagggaaccaatccagcttgtaagcccag 
ggccatgtacagactctggaattagagggagggagagagggaggaacagctccctagtcc 
tgctccagctcaggggctggagcagcaggttatacagtgctcctctgggcaccatgggca 
acacacctctgaggagtcctcacactgaacacacctgagacctcctgggctgctagaaca 
gagctagtcacattacagatgctgtgtcaacagagtatgctcggcaggagcacgcagcat 
gccgggaagcctgatgcctgctcagttccatacacacagtttgagggggctactttgcct 
ttgccagacccattgctgatctctccttaggtgtgacaggaagatcctcagagcagtagc 
acaggttctgagtaatcttcaccggaggcctacagcccagagaaaccctcctccttcccc 
agcagaactgctaaccccaaacatacttcttttataaaatatctgatttctctgacagta 
ataaatatttaccatgttctatatccacgcagcagcgatcgagggaaaacgaggaggaaa 
aaagatcctacaggcggccgc 

>scr_gb-awl42560_3 (TOXMARKER Assignment: 80; SEQ ID NO: 80) 
ttttttttttttttttttgattgaaaatgtttaatttgtaaggcacacagtttatgatca 
ttttaatatctaaaagaacgaaattaacaggactaaaacctgattgtcgaatcatttacc 
aagtttggatgtcacgttgtaaaagcaggcttaaaaagatgactccttacaaaggagtga 
ggtggacctgggtgggacaggctagacatggccctgaaaaccttcttgggtgacaaagaa 
acagactactggactgaagccacagcttccaagaaacaagaaaatgtagtggccaccaca 
ttgggctttgtttccttatgagacattttccacctcatctcgggatcttactgttaccct 
tgcccaaactgcttatggcatgagggttccagagcccagcgccccagccaagtgtacaaa 
agacgtttcctgtagagtgtgcctgtgagggacaagcttgaggagtcctgtagagcgtcc 
agacaagctcacatttcctcattcatggatgatgaaggggatgtcacaagcagaccagaa 
actcctcaatgtctcaggaaaggaccgttttccagagcggcttacaagtgggactttctg 
ggtttccatctggagtttggttttcctgcttggcctcagactgagatagaagagcagtga 
gacagaaagtagacagagaatgagctagcctccgg 

>scr_gb-aw533305_2 (TOXMARKER Assignment: 81; SEQ ID NO: 81) 
tttttttttttttttttagaggttaaaggtgttcatttgccaaccggacagcctgagttg 
gatcccctgagcccccatggtggaaggaaaggattggctcctgtaagttgtccactcttc 
tgaagtatgtgcactgcggtgtgtacctgcccacatacacaaacaggctaggtagagaag 
aaaagggaaaccattaatagtcaacactgatacttatcaaaaatggcactagatggtgat 
ggtttaaaagcttcacttagaagccaacagtgacagcagagacagacctctgttaaccat 
tgcaggcaaaactgaaagacatgctcacacaggaagcaagcacaggcggctttgttgacg 
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gcttagctgaaacagactcaagacaaagcgtgttaacagacagacgcacttcacggtgac 
acgaggggccagctaccaagaagacattgaccccaaaacatgtatacacgccaacagaat 
cccaaaaggcacagtgagaaaggacagaaggaaagttcgaaatagaactttgtgctgccg 
aggtaggagattaacttcccctggagatttccacagtggccaaaacttcggtgaggatat 
5 ggaagacggaggtaccatctgagcttgatcagactctctaaggtgtgatattgcaaatag 
tgcaagccaaacgactcagcgggcacatcacaggttcaagaccagcctgagaaacttagc 
agggccctgtctcaaaattaaaagaggttgtttttaaggaccagcctcgtg 

>scr_gb-aw915573_2 (TOXMARKER Assignment: 82; SEQ ID NO: 82) 

10 cacgagggatcaccagatgctgccaggtgctggttgccaaggttgaaatgagaagtttct 
gttaactgggtacagagtttcagttttacaaggtgaagaagttgggcagacagatggtag 
ggatggtcacacaaagatatgaatgtatttactgccactgaagcaacactaaggtggtta 
atctgagaagttatgtttattatttaaaggactaaattgtcaagctaacttcaataactg 
ttttattttgtacaactgacatattcatatagatgacatctctaaagatgtctttatcag 

15 tatttaaaactgtgttacatctcttaggaatttgacacacagtttcacttgtaaggccag 
ataaccaattgtagggtgcgttattacccagaatgtggtgggtccaagagcttgaactca 
cgatcaagtttggtgacacttgcctttacccactgagccttctcatcaacccaagtttcc 
caggaattaagtaatctgtttccctaattccccttaagcaaacatggcagtcaagtgtac 
agcaggagacaggttatgatttgcatgatgtaatttaataatgtaaccatctttggggaa 

20 tctaattagtaccaaaagagaaaaaaaaaaccaacaggaaacagctgtctctctcacaca 
gtgttgagagctttccctcccactcattgccaatcagtgtcctggtgtcccctcaccctg 
cctctgtctctgcaacctgccagcctccaactgaacagacttccattcctgtgcaatcta 
agtcagtctctccagtctcttcctccctccctccctcgctctccctctctcttataaagg 
aaagaaagcactcactgggtataattgatgtctatatgcaggtgagggcaggtacaagat 

25 aaggcaagacctgtgattgggcagtgaaaaaagaaaggcgggggcagaggttttgtaagg 
caggagagatgaggaggtagaagaaccaagaaaaaggcagagaaggacgacccagatctg 
cgtggctttaaccgggcaaaggtagctatgaatatttcataagggacagatttatatagg 
acaatttgtcttacctaggtgggcagtttacatcaataccaattggttgtgactttattg 
tgtggacgttttgtggactgagaatttgctgatatgaatctgactgctaaattacaagct 

30 ttgggttttgattttaactggctactgggagttgtgactgtagccacaggggcagatgct 
gggattgtgagcagggttcacagcacagtcccaggatggcagctgctgctgggcccagag 
aggagccagtgccaacatggggctagccatggaggtggagagatcgctggggacagagaa 
gagcaggaggcagtgtggcttggtgcctggtgccccacccacccctgcatccattttaat 
tatttactgctacaactgggtgcttgcttttagtttcagagggttagtccattagcatcc 

35 tgaggagaagcatgcaggcaggcagacaggcatggtgctagaagggtagctgagagcttt 
aaatcgtgatccgcacgctgcagagagagaaaaaggaaacagagatggagggatgactgt 
ccctggcaaggactttcaaaccttaaaagccacctctagggacacacctcttccaacaag 
gccacacccctactccttcccaacagtccaccaactgtgaacaaagcatccaaatgtatg 
ggccgatggggccattcctattcaagccacctcactgaaggaataaattaacatgtccca 

40 aagtattaaatgtagtcatttttctcagtactgagacaaaatatctcaagaaataaaaaa 
acactgaaggacgtatttcgtttggctccccctttaaaagaaacagtccaccatggccgg 
gaaggcatgtggctggtcagagtgcaccctcatgcaggaagcagagagtgggggagtgct 
cctcgaagccttttcctttttatttagcatgcaccccaagcccacaggagctggctaacc 
cagcaagccttgctggcctggaagccacccccaacaaccatcatcaccccagtgcctccc 

45 tacagtggggattatgagttgccaccatgctgtttttcacatgggtgcaggggatttgaa 
accacagcctcctgcttgtacagaaagcatcctgaggagccatctctctggattcaccct 
tcacttttggctgactgggcctgagctggagtcacctgggct 

>scr_gb-bel08509_l (TOXMARKER Assignment: 83; SEQ ID NO: 83) 
50 tttttttttttttttttggtttgttttgttttgtttttctttgcttttctaaggatagtt 
taaaatacaaacaaattaaagtatgtgatatgtcaacatgatcatgcccctcccagacac 
agcctttaactgtccagctcaaataagagaaatgctgaagcttaagatgtctttgtcctc 
aggaagacatcacatgtgtggttgtcctgacactgcacatggcagcttccccacaacatg 
ggcccttcgccttcacactgacaagaagtgtatgcccttcacactgacaagaactgtgtg 
55 ctcactacaacttgtattggttgtaccttccccaaaagcagtaatgtatttctcaagatg 
tcctaaatcaagtggagactctcctctggaaggaactggactcagcctcgtgccgaattc 
tt 

>scr_gb-belll483_l (TOXMARKER Assignment: 84; SEQ ID NO: 84) 
60 tttttttttttttttttcagacaaggatggtttattgaatggaccccctgagactgatca 
atcagggccagggccgcagcctcagaattcaggggctgagccatgactctgaccatttct 
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cagggccggcttataaagggaaaaccccacaaagccacaatgagctcgcatgcaggtgct 
gccggatggttggctctgactcaagccatttcagacagaacagctcatatttacctttaa 
tgtggtgggccatatgtaaagctttgtgtaatttattaagttgaacaaacctcacagcat 
gaccttgctctgagtcgagtcattttctgtatcaatgatggcaggcatggaacaaaatgg 
5 ctatagctatgctaggtggggtagacctcaacaggataagaaactaaaaagtaacaaaga 
tgagaagacaattgggcatcctggt 

>scr_gb-bel20 910_l (TOXMARKER Assignment: 85; SEQ ID NO: 85) 
tcttttttttcggagctggggaccgaacccagggccttgcgcttcctaggcaagcgctct 

10 accactgagctaaatccccagccccaggaacaagccttcttaaacaaccaccccatctct 
ccagtccctgatcaatattttatgactacgtttactctgtaaaacaaaggattaaaatct 
aatccgattaccagtcttactagacaaaccttccaaatctgagttttctcaagtataaac 
acttcacaacaccttctgagaaatgtccacatcactcaaagacaacacatttgggaggtt 
tttatgggcttcttttcatacagaacctttcaaagcttgtaaaacttcgaacctagggac 

15 atttgggagttcttctcggtctcacacaaaacggacttgctttcaaagatcccttcggat 
tctatttgacttagcaaaaacacagcgcaaaacacacccctgtaagaacaaaggtgcaat 

tg 

>scr_gb-bf 285287_1 (TOXMARKER Assignment: 86; SEQ ID NO: 86) 
20 agttcatgtgcattggtgtttgctcacatgcatgtctgtgacgtatgcctgtaggagggc 
atcagatccctgcaactggagttattgacagttgtgagctaccatgtgggctgtgggaat 
taaacctcgaaaagcagccagtgctcttaaccactgagccatctttccagccacctcaac 
teat tcttaaatccacttaagacatagaggaaacactat tec ttctattctgtttgctga 
tatctgtaaaagtagacagacttgcagagtggtggtggcagacacctttaatgtcagcac 
25 tcaagaggcagaggaagacagatctgagttcaaggctaccctgatctacagatagagttt 
caggtcagtcagagctttatagagagacctgtctcaaaatacaaaaaacaaaaccaaatt 
aagtagacagactcccacttacacgaaacgtaaacactgtttcacacacttcagaatcac 
atttaaactaccaatcaacaagaactgacagaaccaatatcaggaaacctcatccatata 
aagcaacgtcacagcaccaagcagttaacagcttttggctcgctctaatcgaggatccca 
30 aacacaaatcttacacagacatggggaggtacatcctacatctcatctcggtcgcagctc 
atcgtcagtcctagggatcttttgggtccccacaaagatggaggcatagccttgctcttc 
ttgcccgacaaggaggccagcaggccaggaagttaaactgccaatacctgccaatgctgg 
tctc 

35 >scr_gb-bf390383_l (TOXMARKER Assignment: 87; SEQ ID NO: 87) 

tttttttttttttttttanagnagctgtattttctttatattctgcatgggatatgaaat 
aggggttttgctccacagggagcctggtcaatatagacaggatgtantcagggtgtgtct 
tccaaggtcatctccatttccaggcagatggaaaaaaaatcatgaacaatcatgttgatg 
attttgaaagatgagtataggcaatagcatgtgtcctctgtcctgagcaacagatctcag 

40 ggatgtgagggtgtgcgctttctggatggttcaccatacgcatcttcagcaccaaggcta 
tgcaagctttgttcagtaaggcagaacatcaggaactcaggagagtggctcccggaaggt 
gatcatgtggcttgacccctgattatccatcttcctcaccaatggtttgcttacattcga 
agcttaaagccttaaagttaacttcgtcttgtgatgctgttaaatgttttcaattacagc 
acgatc 

45 

>scr_gb-bf558463_2 (TOXMARKER Assignment: 88; SEQ ID NO: 88) 
ttttttttttttttttttaagagaaacattttaatatctgcaggctcacgcaggattcaa 
ctgtgtgtggtacagtctagagtgacttgcttctatttacttccacacacggtgactttc 
gatgagatggttaagctgagcagtatacattcctgaacagtgccaaggatcctgttttca 

50 aacagctttatcaatcgaaacatcctcaaagagccattggaggcagtgtggctgggccat 
ctgcactaaaatcgcttattcagaaggtgtcaaagcagccgagggccttgagccacaggt 
tgctggtgttcacatctcagctgggacgtgataaagactgcatgagctgcagatccgcaa 
acagccttgcaggctggctctgctcctgcaaagtcaatggagccacaaggtacttcttaa 
tggtgtcatctgttcaggttctccagggagttaagggaagcactgtctttgcacacagtc 

55 tctatcacaagggctctggctagcagcatgagagttccctctcagccaggctgccacagt 
gagccatctattgtcctcactgcagagtgcacaggatgaagatgtccactttcctcatca 
gacttgctgacagcctcatttcctgccaaacggatcagaccacactttcaaccctggtgg 
ctgcacatcttcctggacgataccagctcgatttacagcctgctccttctggtattcttc 
cagccgcagaaggggccggaagtagatagggtagaaggcggctccgaccatagagatgaa 

60 gcctccgaatatgagcgcggtgcgcaggttccgggccgcggccatggtgagaaagggggc 
tgcagggcgggcgaaggcccggcacgctccgaaacccgactcccagccttaaggtcgcga 



111 



cccggctcggaagaggcggag 



>scr_gb-bf560709_l (TOXMARKER Assignment: 89; SEQ ID NO: 89) 
tttttttttttttttttacatttatgaatttttaatactcctgtcaagatcttacaagga 
5 gaaattactttgggaggtgggtatggaggttagaggtaggttggaaagtggatcatgatc 
tcaaaatagtaaatgctagctgagtggctttcccagagagaagcgacatgccctgacgag 
actggagaacatgtgtaaaggagagcttattttcaggtctccgctggcctccatcctctt 
caaaaacctcagctcctgggttctgctcaacccacattctgtaatacttgctcaagtagg 
cctgtagcaccttgtaggatacagacagttccaaatggatgtccactccagtctctggct 
10 gctctattctgtacttctcttgaatcacagcttttatccatgtaagtagatgcctttacc 
tgggcacttgaagttcagaggagacaggtctttagatagaaatgtgcaaattacttatgt 
ggttattgacaatcaatgactgttctcccgtagtctcccctcgtg 

>scr_gb-bg662990_2 (TOXMARKER Assignment: 90; SEQ ID NO: 90) 

15 ttttttttttttttttttacataaactattttatttaaataaaaccaggactgaccctct 
cccacacgcaccagcacatgcactcgcacaatcatgtcctccgtttctgttcctcctgaa 
cagccacctcaaaccccacaggttttcattgtgaccatccttgaaacctgaaaattggga 
gatcccatgcgaaacactggcactcttcccccaaccctgggcaagcattctcctcatcct 
cctggtgggacaggagctcagctcttccaaggcacccagatctggtgtggtttcccttca 

20 cacaacccgggaacaccaatacccagagctgctctttgaggctgggacccctcgcttcag 
gtcaactcctctcacacaacagaggaggctttgtaaccatgcttaagcgctctccaaagg 
ttcctggcataggtaccgtctggtatgaggaagagcgacagagagcaattgagcaccaag 
ttccctaatgccaccctgaaggagggtgccaagctccagttcagtctgtaccaagaaaaa 
gcaagcctagcgccacacatggggaaggtggggatggcaaggtctcagccttgagaatct 

25 cacatctctaccctccagcatagatcccatgagggacccactagcaccttggcgattgta 
agggctcagcccaactggagacacaccacacaaacagtggccatttggagttggcccaaa 
tgcctgtgtcggtaacagggtttgactcccgcatctaacactgactgaaggacacacagc 
acagcagctaaggtcacgagaggtgcactgacagaaggtgttgtcttccagaggcacatg 
gacatttcacacactgctcacaggcaagctgggacaggagaagagcacaggctgccaggg 

30 actcagcagcgtatctagggcatgccctct 

>scr_gb-bi278552_l (TOXMARKER Assignment: 91; SEQ ID NO: 91) 
ttttttttttttttttttggggtttggtatcatttattttttttcttaaacccacttgta 
gtttgggttcagctgggaagcaggatatacgggtagagggaaggggacggtgcgagcagg 

35 attggcccatagctttgggggcaatctccaaaccctgctccagggaggtaggtcctgttg 
tcaggctcccagctggctcaggtgaggctccaaatggatcttctggagcaagtgtcctga 
gcagaggagaatttccattctctccaacccacctcctcaaagacccagtcagaaggtttt 
ccaacacagtgccaggcagttgaggggacatcaggccacgggcaggcctgagtgggtggg 
acaaggaacactgtctggcttctggttccaggtaacaacctaggatgtggctacccagag 

40 gctgccatctagagtgacctccgggagctgcttctcttgcttcctgggctgcctgggatc 
caaacctgcagctgccctggttgcaaccagtggtatacttcccaccccccacccctcaga 
caaaataaaataaaataaaatacaataaaaattagaataaataccaatcgggtcaacatt 
tacatttacacaaatggacaagatgatcccccaaaccgtagaagtttacagactggatgg 
gaaggatacgcagatgaagatggttttggggaggaagaggttcgccgtggtggttgatgg 

45 tggggggtcctggccctgtccaggggagggccagagccctgcaggaactgtggtctcaga 
gcttaggcaatacggccagttcatgaggagaacagtgacctgcaggccacttgagtagaa 
aacaaggaccaacttgtcctgacaggtaggggagcctaaaaaggctcaatatgagatcgc 
catggccagcaggacaccacagtttgggagaggcttcgcctcctgttcatccattcagag 
gcggctttgataggccgtccctctggcagcgggagagcctctggcctggggaggtcaggg 

50 tctgtgggtacctgcaacgcccctacttcccctcgtgccg 

>scr_gb-bi278749_l (TOXMARKER Assignment: 92; SEQ ID NO: 92) 
tggctcagtggtcgagcacagtaacaacatggagattctaaaaacagagaaagagaaaag 
caagaagatagtggagggagacaagaagaaaggctggggggccagttttgttatttttgt 

55 tttggttcagctatatgctccacacttccaaagcagcaaatgtgttgcatcaccacccaa 
acctgagaaagctacagcatcactggcaaggacaagctagcgcacgggtgacatcctcta 
accctgccattgtaaattatacaactgcagtttccagcacacaccattgcctccgacact 
attggagagcccgtgacactccaaaaactgctaaggcctttacagtatctgaccttcaat 
ggccccgaaaactggtaggccgcttctccccattccaacccaaaaattacatgcgagcaa 

60 cggaagagaaaagcttttaagcccgcgcggacgaagagaccagcggacgctgctgaagac 
cacagaccaggtaagccagctgaggctggagtttattgccgatgagcgctgagtcctggg 
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gaggagcggggaaggataaggtcgggcaggatcaggaccttggctaggagaggcggcgcc 
acgaaggcgaggccgggaggtgcagacagacaggcgcaggccacggtgggggcgggccag 
gctatccaggcactcggtgagcggtctccggcgtcgctcccggagctgggtggcggctgt 
ggcggcggctccgcggcagtcctggctgcggtcgtggcccaccggaggccccaagcaagc 
5 aggacgcggcgggaggcggggcgggtggtgctgctcgagcacacggagcagctgcagcgc 
tgggcaaggggtcggcggggcccgcaggcggccgcgtggggacccagatgagcccgtagt 
ataccgcaagcaacacagcagccaaggatacacacaggaagtaggcgcagacaggggcga 
gccgcagccatcgggcgcggggcccctcgctcagccccgtaccacctgggctctcgccac 
cactgcccacgcagctcgagcccccgcatgcgctgcccactcagcctgtaccgaccccgc 
10 ccccaccccgccgcttctagcaagccacgccccttctagagtcacgccctatcagaccgc 
cacccccctcgtgccgaa 

>scr_gb-bi295938_l (TOXMARKER Assignment: 93; SEQ ID NO: 93) 
aaaaatctcgatgccctcaactgttaggttaaagcctgacctgtgtcactatgtgctgtg 

15 acacgaacctaattcccaagtggacagggacacctgagtggcatttcgtgcttcagttcc 
ttccctcatgattcttgctgggtcctcttcactgaggctctcccctgagtcatatattta 
ctggaaaggctacctggagagcctttgaattgtgggcattcctttttaatgtgtccctct 
cttccacagatgaaacagcgcttttctcttgagtctctgtcatcctgtctcttccacttt 
tcggctggtgtcctgacaagtttctcccggcccaggtcaacagctgccctcattggcttg 

20 gctttggcagctgtgcacggtgcagccttgtcttcttttgctgacacttccttttctgtg 
tacttgttctgaatttctttgtcctctttgcttcttttttctttgctctctgtgtacctt 
tggtttggggtatcttcctggtctcgccgccgcctcacttttctcctcatgggacagtcc 
ttcatgaagtggccaattttcccacagatccggcagcacctgtcatttggggccagttct 
ccctcagtcaggacatccggatcaaagaagtatgccaggatgtcctttggaaatcctttg 

25 actggaattccaaatactcttctaccattgataaaagctttcattataaaatttgtcatt 
ttccttgataatccagcaccaagattgtggttcaaatcaaagggatcttcaatgacgatg 
tattttgaggtccactgtttcttaaaagttgtaagcagactttttcttctgatgctgatt 
acgtgttccttaaagtcaaactcctcagtgtagaagcgtagaagtcccaaccacagctgc 
ccaacagattctgtattttttccatattctggccaacaagtgggcagttcatttatttga 

30 tcgaaaaagtagatattccagccatcaacaagtatttctggtttcttttcacctttgtat 
atctcctgaagcacagggatgacagggggggaccgctgctggaggaagtacagcaccata 
agagtgtaagcgtatgatgacaagctgcctctggacgcgtcaccgatgtcacacatcttt 
gtgaacactttcatggtgtagcacaggtatttcactctggggtcaatggctgagtatgca 
aacaggagccgcgtgttgtgaagagccagtgtgtcctcgtg 

35 

>scr_gb-bi296376_l (TOXMARKER Assignment: 94; SEQ ID NO: 94) 
ttttttttttttttttttccaggagtcccttcggtccctgatagcgggagcctggacctc 
tgaggccgagagggtgctgtgtccccggcctccgagccgaggtggcccggctagggggcg 
ccacggagtttttttttttttttctttttcttttccaggagtcccttcggtcccagccag 

40 cgggaccatagacacttttgaggccgagagggtgctgtgtccccggcctccgagccgagg 
tggcccggctaggtggcgccacggattttttttttcttttccaggagtcccttcggtccc 
tgatagcgggagcctggacctctgaggccgagagggtgctgtgtccccggcctccgagcc 
gaggtggcccggctagggggcgcctccgaggctttattttttccaggatcctccccggtc 
cctgccagcgggagcatggacttctgaggccgaggggaagctgtgttccaggctatctac 

45 catggcctcctcggtctgtgagcactcagggttctaaggtcgaccagttgttcctttgcg 
gtccggttctctttctacatggggacctcttggggacacgtcaccgaacatgacttccag 
acgttccgtgtggcctgtcatgtttatccctgtgtcttttacacttttcatctttgctat 
ctgtccttattgtacctggagatatatgctgacacgctgtccttttgactctttttgtca 
ttaaaggacgttggaagaggcttgcaccaaggctgtttgcttgtccagccctagctcttt 

50 tcttctgcgcatgggcctcttcgatgcttgaagcttagcgtccccccatgagtacgcgct 
tcctgctttcccgtgcttgcttgcctgtgctctgtggggcagctttatgacaaccgtccc 
gcgtgtcaggcgttcccgatttccccgtggtggttgtcgtccgttaccggtaggagtcgt 
tggtgccgagtgcgactgaaagggttttcccgtttggtgctagtgaccccctggcgtgct 
cctctgcggccgaccggtttttttatttgttttttttttttttgttttttttttgttttt 

55 ttttttgttttttggaaggagttcccgaacctccgctgcttggtggtgtgtccctttctt 
tcctgctgtgtgcctcccgagttgcaccttttctccttcgaaggggattttattttttta 
tttttattttttttttatttttattttttttgaaggagttcccgaacctccgctgcccgt 
tgagtcccgttcttccacgccacgtgcctcccgagtgcaacgcttccttttttttctcgc 
cctcgagaagggtaaattttttttttgtgtgtgtgtgtggcagtgttagcgacttcttcc 

60 cgtgctctctctcgctcttctcgctcgtattcccgtccagtgcgtgttagaaagctctca 
cgcccgttgttcccgatgcatggcgtgtctcgctcccgttggatcgatgtggtgctgccg 
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cgttctcttcgggccggggcctaagccgcgccaggcgagggacggacattcatggcgaat 
ggtcattcagcgcgaatggcgaccgctcttctcgttctgccagcgggcccctcgtctctc 
ctccccattcctttgcagggtggtgtgtggaagtcaggggtgcggctgtccggcacgagc 
gctgacccgcgcacacttgctgctgtggttcgcggtgtccctgtggacgtgtcgggggcg 
5 cttgcccccacgccgttcactgcttcgcggccctcttcccccgtgccgggggaaggtggt 
agacccgctgcggtgcatacccttcccgaatggtgtgtgcacgcgccctgctttgtgtga 
gccttgcggtgctcctggagcgttccgggctttgaccaccaaggtgcccgcttctgagtt 
ggcggtggcgcttcccgctccccggcgtgcctcctgtgctccatggtgcttgtgccttta 
cgctttcccttgtcctagttgccggctttctgcacggtgacagaaagggggggggtcgag 

10 gagttgagtgtgcggttaaaaggctccttccgttgggtgagcgcccaccccgtgcctatg 
tttttggtgccttcacccgcgggccctgcgcggttagggtggtgctgagcgatcgcggct 
ggccctttttaaagaccggactccctcaagtcaaggctcctcctttgtgtgcgccttgaa 
gaggcctggccctcggcggggacctgtcgcaggtccccccggtccgcgaatgctcaagaa 
gaccccggagaaagagacctttgccgataccgcagaccccccaccagctggcgcgtggtc 

15 cttcccgttctgtcccgcgcctgttgctcgtttcccgttgcgtgcacggagcccttggct 
gctcgtcggtgttgggttcgtcccgccctcagtgaggaatttgccttctctagctatctt 
cggaaagggctttacgatctccgaggggcttctcccggatggtcccctcggctgcccgcc 
ctgacctcagccttctgcgcgcagcgtttgctctctcgcctaccgcgacccgcgcctccc 
cgctccgagtacgaggagggatcacgcgggacggggctctgtcgacctgccgctgtgcgg 

20 agcttgtgggggagattgggtttctggtggcaggtggcggggaagggccgtgcac 

>scr_gb-bm384392_l (TOXMARKER Assignment: 95; SEQ ID NO: 95) 
ttttttttttttgtttttcaagttgcacattttaatttacaatgtttaccagtaaaaagg 
attagttacaaaaaggaaagctgtctgtacaaaataagggtttttttttttcacattcat 

25 aaagagaacccactgtgaattcttaccttgtgaagtcaatactcaaacagctcactttgg 
taaaactatcttggaaggactagtaatccaggcaagataataaaattatcagcttcccaa 
tcatgtccaggagaaagaattttctgaacattttccctgtacagaaaagctctctgtact 
tgcagatccttagaaaagccagtgctctcaggagacagcctggtaccaggacgaagcata 
atctcctgctcactcaaatggcaatccttcctgaatctgacagacacacatttatcatag 

30 cctcaggtcagcaggagaaccagatggttcaggatcagcctctctccactcaatagttta 
tcatataaattaaatatggagaggtacacatgagaaagggggagctctttttcaaactcc 
cacttcctaatataatacacatcacagttttaatgagcagagaagggtaagtcaccctgg 
tttgggcacatttcctcaagggaaaaaccaaagtatcaaaagccttcaaagcatactggc 
ccgtcccactgcagccagcagcctgattccagaatgaaagcatacagtagctgtaaagcc 

35 ctggagccttcagaaagctttatttagtgataagctgagctctgctggcaaaagcccacc 
tataaaaagggagcaggtctgattcacaaagtgtatacatgcatgacccaaggtaatgaa 
gaccttcaaatgcaaatgatcctaaagctattggaacctctaattacgagtgacccgttc 
agatgtgcctccattagccttaaaaactgaccaacacacatctgaagaggcacttccctt 
agcattaacataaacacttgaccagaaaaggcatggtccaaaaaacagttaactaaaaat 

40 ttagagtctaaacctctcttctccaccgactgaatgaacacacccgcaatgaggaccaaa 
cagaatcagtgcctccagggacgtgtgtctgtctggccatgtgatcaggaacctcctaac 
atagcacagcacagcacagctgctctgggcacacaaagccagttcaccccatgaagaaac 
acaagggattgtgattaaacccatcccctgtgtcaggagcaactccactatggttttgat 
cactcagctcagagggataggagtgcctagcaacaagtcctaatcctcgttactcccagt 

45 ccgggccctcactgactcagaggtgcctttgtgtataaatatgtgagaggcagcaaatgg 
cagcactgctgacaggctaatgcaggccccacagcggagaaagttcttcctctgctgctc 
caatcttctccctacagttacagtcctgccagtgatggccaaggaccatgtgtgagccag 
ctctttgtgaccaagctttggcaagtcagtaagtttgtcaaaggcaaaatccttctgtgg 
acaatgctagctgcagctctggggacgtgtgagagaggagagggtcctctgacgggattg 

50 gggacgtgtgagagaggagagggtcctctgagaggatttgactcatcagcccctcttgcc 
cagttcattaatcagaaggaaggggagaggagaagacagcagaacatgagtcagttgtga 
aatctgcacagctgacatttgctcttcacagcagaaaggacttgaatgagaatcatgaaa 
cttgaggaacacttgtattttccttcgggatttaaaaatgtgtcttgtaccaaaagacta 
cattcagtgtgggtcaggtccaagagcggcagcaagagctcggccattaagcgtgcccag 

55 cactgggaggagactgtcatctgcttagcatggctggtgagcaggccagggctgctcctc 
actggtctccaagtcggaagccctggccccagttgtgtctcccacctccgccattctgat 
cagcagctcgcctcatgcttgcagggggcacaccgaagcccgacacccctcctctcctgc 
tgggtagccagcggtacaaaaactgaggtgtggacagaaaattccttcctcccaaatcca 
ttgggtatctgaacatcaggaagaaataaagatgtccgacaaggtttccaatgagctcat 

60 tgatgaccgagcctccaatgatatagttgaatccgaggataacccaaggtaagtaacagg 
ccttaaatcgtgttccaaaccaaaatgatacaatcaggtctctgttcagctgggcccaga 
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cgtaaagtactgacatgattagaggaatcatcagcaactgcatatccatggctaagccag 
taataacaatgcagatccagttg 

>scr_gb-bm387477_l (TOXMARKER Assignment: 96; SEQ ID NO: 96) 
5 aaatttcaagaggtcagagtggggcttagattaagtaactaatgcacagcaaaacgctgt 
gagattaggtgtgaaggagctggctgccctcctgtctcttcccttctctatcccacagga 
gctacagagagagcacagcagccagacgctggccaaacagggaacactctttatgccaag 
tcgcaaagatgacaagcggcatgaggaggacccagggccctcctttgtgtggaaggacgg 
agaggttctgggagggctgggaagggtatgggaggatcctttgtgtgggaggattgagga 

10 aggcctgggcaggctgggaagggctaggaccgctctcctttgtgttagaggtctgggaaa 
gtctgggaggatcctcctttgtgtgggaggactgaggggctctgggagggctgggagggc 
cctcctttgcttcacagttttagatgttgttccatctgctctcggagtttgaatttctgg 
atctttcctgagacagtgagaggatagccttccacaaacacgatgtatcggggaatctta 
aaatgggaaatctttcctttgcagaaagctttgatctcctcctccgtggtggtctctccg 

15 cctcgtgcca 

>scr_gb-bm986259_l (TOXMARKER Assignment: 97; SEQ ID NO: 97) 
gtaacccacctccattctgttcttcggacgcttgcgccagtgggtcaattttattttctt 
tcaaaaataaaagtcgagtgcattcagagacggccttaaggcaatacgcctcatcttccc 

20 acagtaaagatggcgacgccgtgagtaagttacaagtaactccacttccgcaattttctt 
gagccctggtccaagatggcggacgaggccacccggcgggtcgtgtctgagatcccggtg 
ctgaagactaacgccggaccccgagatcgggaattgtgggtgcagcgactaaaggaggaa 
tatcagtcccttatccggtatgtcgaaaacaacaagaatgcggacaatgattggttccga 
ctggagtccaacaaggaagggacccggtggtttggaaaatgctggtacatccacgacttc 

25 ctcaaatacgagtttgacatcgagtttgaaattcctatcacatatcccactactgctcca 
gaaattgcagtccctgagctggatgggaaaacggcaaagatgtacaggggtggcaaaata 
tgtctaactgatcatttcaaacctttgtgggccaggaatgtgcccaagtttggactagct 
cacctcatggccctggggctgggtccttggctggcagtggaagtccctgatctgattcag 
aagggtgtgatccagcacaaagaaaaatgcaaccaatgaaggatgaagcttctgaggcag 

30 gacagagggactgttgctagactctgattctgtttcctcctttctcatgattccttcaag 
ggtcacctctggccattacaaagtagctggagggacaaataacaaaacccaacaaaaggg 
caaggtcacaaagttgctaaattaagctgtacagagaggtgaaagatttgggccttgaaa 
gaggcggtttgtatcccttctccaagcagagccctggaggcattttggagacctggggtg 
taactgacagcatatagctttttgatttctggagacaacctgtcaataaaagctgcttcc 

35 catggtgtgaaaaaaaaaaaaaaaaaa 

>scr_gb-s69874_5 (TOXMARKER Assignment: 98; SEQ ID NO: 98) 

tttgctatctgcacagcccatcgagggacctgaggtggcaaaccctggacagtgggtcag 

gcggcgctcacgtctggggtgacaggatgaagcgggctgtgggctgtgtggagcaccgtg 

40 cacccctagcacctttgggtttcttgtggagttctcgccccagacatcagtgcactggat 
tgcaaaaggcaattcatcttttattggatcaggagcgccatttggagtgtgccattatgg 
gaggctcgtagctgtctgtccctcgtgccgaattcggcacgagccccccttttttttttt 
tttttttttttttttttttttttttttttttgaattagcacaaacgcatttatttactaa 
ccaaaggaatgatcctgggtaaaccaacggtctgacatgggtttcgggtaaagtgtctat 

45 gatgaaaagtcatgaaaaataaaaccaaagaagtgaagcagtgtggttctgtacgacctg 
ctcattgaattgagcttattccctcagccagctgactgctgtccaggatgacgagttagc 
cagtcctcattgtaccttctcatagacccgagtacagatggcattgttcatgacgcactc 
caccaccatcttcccgtccttcagttttctcgttatcgtgctttctttcccttcccactt 
ctggtgctggaccagggcaccgtctgtgaaggtgcagaccgtctcagttttcctgccatc 

50 agctgtggtttcatcaaacttctctcccaaggtgcaagaaaacacggtcgtcttcaccgt 
gctctcagttttgacggtgaggttgttgccgtcgagggtaatgatgcagtctggtttggc 
catggcacccatcttcctaagagccagccctactcctagttccttcatgtagtcctcaaa 
cccgtggctttccaccagacgccacttcccttccaggtccttaaggctggccatggcgag 
cgggagagcacaaaagcagcaaggagacgcggtggcgggggcgctgagggaataagctca 

55 attcaatgagcaggtcgtacagaaccacactgcttcacttctttggttttatttttcatg 
acttttcatcatagacactttacccgaaacccatgtcagaccgttggtttacccaggatc 
attcctttggttagtaaataaatgcgtttgtgctaaaaaaaaaaaaaaaaa 

>scrj3C-119263563_l (TOXMARKER Assignment: 99; SEQ ID NO: 99) 
60 ncctagcagaacgcttgttaggagtctgtgggacaagatagcctctgataaaataaactc 
taaacatgaactccttcaagaaaaaggactggactccaccactgttcaataaagtcacag 
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cgagggatgctagaggcggtagacagaaattaagacattctagatacggggagtggccac 
ttggttgggccaccacttgccttagcataggtaccataggctaagcatggaaggcagtaa 
gggtggatgtcattttaatgagagcagcaaatttagtacatggtttatcaaataaaaggt 
aaaggagtccaagatcaatctgacaaatagatctatcagctgaattgtaatcttggggtg 
gaggggtcagaggtccggcaattg 

>scr_j3C-132556005_l (TOXMARKER Assignment: 100; SEQ ID NO: 100) 

caattgctgctctaggatagtcagagtgtgttctctgtctcctgggaaacagtggaccag 

gaatgaaagcttcaacctggtacccagattttagatgttttagggacaatcagtcaaatt 

tttgtgtgaatgtatgggtttatatgactataactgtgtaagacagagaaatggatgtac 

a 

>scr_sc-132570828_l (TOXMARKER Assignment: 101; SEQ ID NO: 101) 
ccatggacataactacctcctgattaagtccgttaattgagacctaatcagtctgttaga 
ttattgaaacaggtcctgttagcagactgcagggagaaaacacggtcatgaaccaaagag 
tgagtccgga 

>scr_sc-132947646_l (TOXMARKER Assignment: 102; SEQ ID NO: 102) 
aagcttcctccatttcccagtagtgccatacgctggcaaccataggatcc 

>scr_sc-133387221_l (TOXMARKER Assignment: 103; SEQ ID NO: 103) 

aagcttcaactgtctatttattcacagtcacactggctgagatgtcctacactgtgtcca 

gtgcaagtgctgacactggacattgatgtcttcttctgtatcttagaggaaaggtcggta 

gaggtagagcctggcttccggcttgtcatacatgacccctaagtgattatttctactgta 

ccttattctcagaggaattttatcatgaaaggggtccaggagtctccccacaaaccttag 

gaacaccaatctcagtcagacagggatgttttgaatgcacacctaaagtctgatca 

>scr_sc-133555783__l (TOXMARKER Assignment: 104; SEQ ID NO: 104) 

gctagccatttggtatttattagataacaagttagggaactcatgccttggaaaggtgtt 

gttggttgcttgtagttctttgtctggcacagggaagctacagctattatctcaataaaa 

tagctgtcccttggattttttttttttaaataattgcttattcgagccaacatctaaata 

aggtgcatgcattgtatttgcttgatacgtttgttgtgtctctttttcttcttctgtaag 

tttcttcccctccttatttttctttcctcgtattgtatttactggaaaaaccagatcgcg 

cgccctgcaggcttctgtaca 

>scrjsc-133678871_l (TOXMARKER Assignment: 105; SEQ ID NO: 105) 

agatctgaaagttaggcaaaatataagagcagccctctgaagaggggacctgccagctca 

cttgggactcaacattctactgtagagctagc 

>scr_sc-133725675_l (TOXMARKER Assignment: xl06; SEQ ID NO: 106) 

agatcttggggtttcaggcttgtttggcattcaattttaccttctgagcccaggagcgag 

aatcttgaactaaagagggcttgacagtgctagc 

>scr_sc-133955481_l (TOXMARKER Assignment: 107; SEQ ID NO: 107) 

caattgaacagtagtctgtaagtagtgcaacactgtaaaatgttctctttagttcagaga 

gaaaattcccaagcattattccaactgctgctaaaatagatgttataattatcagtttaa 

tgccagttccaaacccctaaataagcaaatattactgttattgccagcaacttcctgaaa 

ctacacaaattcagtgtatccctccctccctcttttcctttcagtcatgaagggagcaga 

tacaacccagggtccaagataggtaagtgatccttagatgattttagatagcaggtggtg 

caaacttttaatcccagcacttgggaggtaaacaggtggatcc 

>scr_sc-134521597_l (TOXMARKER Assignment: 108; SEQ ID NO: 108) 

nctaacaaagatggtttagagatccaggtcaccaatcctcttctcagacagacccatttc 

tggggtcaacagccattactgcatgtagagtaaagggaagtaagacagagagagttcatg 

ggcagtcctaactggctgtgtggaaacagctttccaattgttctgggaatgaatgtagag 

tcagtgtccctgcatgggtcatgataagagtgcctgcaagtgaggcgctcacaagctt 

>scr_sc-172126480_l (TOXMARKER Assignment: 109; SEQ ID NO: 109) 
ctcaggttggccttaaactcactatatactcaaggatgaggttgaacctatcttcctatc 
tctgtctcctgagtgtactgggattgtacacatgtgccaccatacctggcttacgtgatg 
ttgtggatcaaacccatggctttatgtatgctaagcaagcactttatcaactcaaccaca 
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attcatctctatattttaaatgtaatattcctaatatgtctttacattttccagctacat 
tcctagg 



>scr_sc-172130231_l (TOXMARKER Assignment: 110; SEQ ID NO: 110) 
5 tgatcaagagtcccaaacccagagagtctggggtgctgacatctgaatgtggctggcctg 
ccctggctgactgctttcagtgccagccacactgatgccccttagccctctggggttaat 
ttaggaacttgggctcaggccaccgtcaccagcaatgaactcacaaagaatgagatgtgg 
ctgttgatttcctagg 

10 >scr_sc-172755010_l (TOXMARKER Assignment: 111; SEQ ID NO: 111) 

aga t c 1 1 ccggagcaa tgggg 1 1 cage 1 1 1 tgcagcgcc t ac tggacaegggagagae tg 
acctcatgctggcagccctgcgcacactggtcggcatttgctctgagcaccagtctcgga 
cagtggcgaccctgagtgtcctaggaactcggagagtcgtctccatcctgggtgtggaaa 
accaggctgtgtcgctggcagcctgccacctgctgcaggttatgtttgatgccctcaagg 

15 aaggtgtcaagaaaggcttccgaggcaaagaaggtgccattatcgtggatcctgcccggg 
agctgaaggttctcatcagtaacctcttggagcttctgactgagatgggggtctctggcc 
aaggccgggacaatgccctgaccctcctcattaaaatggtacctcggaagtcaccgaaag 
atcccaacaacagcctcacactctgggtcattgatca 

20 >scr_sc-i88295137_l (TOXMARKER Assignment: 112; SEQ ID NO: 112) 

gctagcttaagggttcttctgtaggccgcctcatttcctggtttaattttactttatgta 
tatgatgttgcctggatgtagatct 

>scr_sc-190079504_l (TOXMARKER Assignment: 113; SEQ ID NO: 113) 
25 agate ttttttgcttcccttccttttattgatccttaggaataaatcctcccaaactctg 

ttgtttttaaagttttttgaaagacctgattttttttccattttctttgcccttgcaaat 
aaccatcagtgtaattagttgtccatgctgcaagggaatactttgtgagggaaataagca 
agaattgagtgttgtttactaagaggtcacgcggatggtttttgggtaattatttactag 
t 

30 

>scr_sc-191455923_l (TOXMARKER Assignment: 114; SEQ ID NO: 114) 
tccggagctggggactgaacccagggccttgtgcttcctaggcaagcgctctaccactga 
gctaaatccccaaccccgtcaaaggccatttttatcctcatcaaacaattataccttact 
ttttgagttggaaatgtaattcagtaatagtctgttttcctagtatgtacaaagtcttgg 
35 gctccctcactaacaccaaaggaaaggggaaaaaagagctcacttctttgactttcagtg 
gccttccactcagactatgcttgtttagaacttcggcagcttttttcatgctctcctcca 
tcttgaactcaacaacactataaaaaagaaaagccaaaaacaaatgaataaaaccagtct 
tacttggaaaattgaacttggaaaattt 

40 >scr_sc-195460151_l (TOXMARKER Assignment: 115; SEQ ID NO: 115) 

tctagagaaatatacatagacagcaaggctggagttgagccaggcaacctaagctgggcc 
accggagtcaggcagctgcagaaggtcacgtgagcaggcccagtgctagcctgtgacgga 
gtgatgtagacactcagccacaccagggagccaatctccaagttgtcttggctagactgt 
ggactctgcccttcatgggtctgccacacaggcattctggaactgtctagctagctcttg 

45 gggaaacagctaaaaggactttggcttttctggggtttgcagggagggtaacagtgtctg 
cgcccttgttctctacttctgaatgtagtaacctcaccctctggggtagcatatgacagg 
tacccaactccttttcgtgggcaagectctggcaggggagctctttctgttgcaatgtaa 
cagaggcattgcctctttcaattg 

50 >scr_sc-198205946_l (TOXMARKER Assignment: 116; SEQ ID NO: 116) 

gtgcacagaagtatgtgttctgggtcggaggaaagatggtaggtgtttgtcccaacacag 
tgaaaaggaacagacatgtgaagtcttcagactgtgggcctttgatttacccctcagttg 
gtctatgtgtgtaca 

55 >scr_sc-2573087_l (TOXMARKER Assignment: 117; SEQ ID NO: 117) 

caattgcattgcaaaattttaaaggttacattgaaaacacttgaaaataagccaccaata 
aatgagatgacgataataagagcccctaaataaagaggctaagaaggagttaagtgtaaa 
ggaagagggaagaaatagttaaggcatttataagacactagaaagtctagaagagagaat 
gttagcagtacggagtcacagctaaaaatctgcatcttgccctttaaaacccaagagaga 

60 aagctt 



117 



>scr_sc-2585074__l { TOXMARKER Assignment: 118; SEQ ID NO: 118) 
agate tgctggtgtttgcctccacagtggtgaggttgcatgtacatgccgaccatgctcc 
tatctttcacatgagtgctgtggaatgctcaggtcttagtgcttgtacaagcaccttact 
caactgaaccattgtcttagcccaatagtgaaacactgaaaagttattttacccatgatc 
5 agaagctttaacaatcaactagt 

>scr_sc-8571871_2 (TOXMARKER Assignment: 119; SEQ ID NO: 119) 
cctaggtctgccagtgaataagaagacccctccccggaaagtcccgagtttatgttccat 
gcgctattcaatagccttcatcgcacatatctgcaacttcacattgatagcacagaattc 
10 catcataagcatcaccatggtagccatggtcaacaacacggaccagccatcccacctcaa 
tagctctactgaatggtttcctgatggtttaaacggtgatcaacatgaagctt 

>scr_sc-87731837_l (TOXMARKER Assignment: 120; SEQ ID NO: 120) 
tgtacacaggtagtcttaggatttctgttgctgaaaccgtgggaagggaacagttcaatg 
15 agtaaaaccaagacagaagtcaacctggttagaagctggaggcaggagaagatgcagagg 
ctgtggaggggtgctgcttactggcttgctccccatggcttattcctgctttcttataga 
acccaggaccaccggcccaagggttacaccatctgtggtgatctgggccctcctccatca 
accactaattaagaaagtgtccaagtttggctatatcttacagagatgttttctcaattg 

20 >scr_sc-87869413_l (TOXMARKER Assignment: 121; SEQ ID NO: 121) 

cctagggaatttgccattgtttagtttaagctaacactccaaaggtaatctcctatttcc 
tcttttcctttctgtcctccatgtggctgtcatgggcatgcagcataccagttctcaggt 
gcctggaacactggccagtgctctagcccagccactgtgccctgaaatccttccctgtgt 
tcaatgctacagcacatcctccagactgcctccccacccccagcaaccgaattgagcagg 

25 gacactaagacagtcctttggagacttccactggtctgttgaaactttggctgctctcac 
agcatagctcctcttagcctgtaacttagtgctgctcaggctgactgatca 

>scr_gb-ai233262_2 (TOXMARKER Assignment: 122; SEQ ID NO: 122) 
tttttttttttttttttaaggggccaagcagaagacaagctgcctttattatagttgatg 

30 tcacagctctgcttgtaatagattcagccccagaaacaccccggttaaaacagcacggtt 
gacttcaatggatagagtctttggtaaggtgaaccagaccagggctgaccgacaatcttc 
gggcccctggcccaggggtagcctgtagtcttacgtgaggcccagcatggcctgaagttc 
ccgagctttatcatctggcagagagcccagggctgtgtggaagctgtcgctgtgctgctt 
ggccaggaacgtcagtagtagtagcagtgcggccttggtgtctggggggatcctgttgtc 

35 tggcaggatcaggctgcagatgcgcaggagctctgaagccacacccacaacctggtcagg 
gttgttctggtgcaggaagctgaagaggtgacctatagtgacccattcctccatgtcttc 
cttcaggggcagggcatgtagcagggtagctagcacctggggctctgtttttcctgccgg 
actggccatcagcagacgggcaagagccccacagatgttatcacggactcgatcatgccg 
ctcccttgccaggaggggcaaaaggaggcccagtagcttagggaagtggtcctgagcagg 

40 gcagcccccatgctctgcaagtacgcccagcccaaagatggcattgctccgcacctcggg 
gtctgcttcccgggcattgtttaacagcacaggaaacagccgggacacaaattgggctga 
ggcagcacctagaccctgaatggattctgccagtgtccccactgcaaaggacttctctgc 
cactgtacagctctgtttcgtcttacacagcaataatggcaacctcgtg 

45 >cgrrs0h0310. 9JL3952 -135 (TOXMARKER Assignment: 123; SEQ ID NO: 123) 

tgatcaagggcgacacatctggagactataagaaggccctgctgctcctctgtggaggcg 
aggatgactgaggagctgcctggagtgccctgggcccgcctgctgcccaccatcagcttc 
cttcagcaccacgcctacttacgttcaatgcctgcctgcctgccacgctgccttactcac 
acgagtgtgtgctaatgaccaaagctgtctcgaatgaaagcagtgttctgctgttctgtc 

50 tgacatagaccttcccacgtctctcagtctagtatctctaagttgcgttttctatcctct 
tctaaagctt 

>scr_gb-ml3100.5_2 (TOXMARKER Assignment: 124; SEQ ID NO: 124) 
aagctctggttgcttgacattgttgtacatatagggtctcgagccccttagagctcgtcc 

55 agttctttctctgattccttcaacgggggtcctattctcagttcagtggtttgctgctgg 
cattcacctctgtatttgctgtattctggctgtgtctctcaggagagatctacatccggc 
tcctgttggtctgcacttctttgcttcatccatcttgtctaattgggtggctgtatatgt 
atgggccacatgtggggcaggctctgaatgggtgttccttctgcctctgttttaatcttt 
gcctctctcttccctgccaagggtattcttgttccccttttaaagaaggagtgaagcatt 

60 cacattttgatcatccgtcttgagtttcatttgttctgtgcatctagggtaattcaagca 
tttgggctaatagccacttatcaatgagtgcataccatgtatgtctttctgtgattgggt 
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tagctcactcaggatgatattttccagttccaaccatttgcctacgaatttcataaactc 
gttgtttttgatagctgagtaatattccattgtgtagatgtaccacattttctgtatcca 
ttcctctgttgaagggcatctgggttctttccagcttctggctattataaataaggctgc 
aatgaacatagtggagcacgtgtctcttttatatgttggggcatcttttgggtatatgcc 
5 caagagaggtatagctggatcctcaggcagttcaatgtccaattttctgaggaacctcca 
gactgatttccagaatggttgtaccagtttgcaatcccaccaacaatggaggagtgttcc 
tctttctccacatcctcgccagcatctgttgtcccctgagtttttgatcatagccattct 
cactggtgtgaggtgaaatctcacggttgttttgatttgcatttcccttatgactaaaga 
tgttgaacatttctttaggtgtttctcagccatttggcattcctcagctgtgaattcttt 

10 gtttagctctgaaccccattttttaatagggttatttgtttccctgcggtctaacttctt 
gagttctttgtatattttggatataaggcctctatctgttgtaggattggtaaagatatt 
ttcccaatctgttggttgccgttttgtcctaaccacagtgtcctttgccttacagaagct 
ttgcagttttatgagatcccatttgtcgattcttgatcttagagcataagccattggtgt 
tttgttcaggaaattttttccagtgcccatgtgttccagatgcttccctagtttttcttc 

15 tattagtttgagtgtgtctggtttgatgtggaggtccttgatccacttggacttaagctt 
tgtacagggtgataagcatggatcgatctgcattcttctacatgttgccctccagttgaa 
ccagcaccatttgctgaaaatgctatcttttttccattggatggttttggctcctttgtc 
aaaaatcaagtgaccataggtgtgtgggttcatttctgggtcttcagttctattccattg 
gtctatctgtctgtctctgtaccaatcaccatgcagtttttatcactattgctctgtaat 

20 actgcttgagttcagggatagtgattccccctgaagtccttttattgttgaggatagctt 
tagctatcctgggttttttgttattccagatgaatttgcaaattgttctgtct 

>scr_sc-170396977_l (TOXMARKER Assignment: 125; SEQ ID NO: 125) 
tgatcacgctcagcccttggtaggacattctacagagtctcttgctgcccctccgtctgt 
25 gccagtggtaccacacggggcagcctccgtggaagtttctagttcacagtatgcagctca 
gagtgaaagtgtggtgcatcaagactccagtgtccctggaatgccagtacaaactccagg 
cccagtccaaggacagaattacagtgtctgggattcaaaccaacagtctgtcagtgtaca 
gccccagtattctcctgcccaatctcaagcaaccatatattaccaaggacagacatgttc 
aactgtctacggtgtgacctctccttattcacagacaactcctccaattg 

30 

>scr_sc-14059147_2 (TOXMARKER Assignment: 126; SEQ ID NO: 126) 
gctagcatcgtgatggccaagtgcatccctgtgcttttttcttttctaagaaagattgaa 
aaccaacagttcttccccaacagctgcctaaattttaaggggtctgacccttacatttca 
attgggggaatgaagggggcccaaccggcttaattgctgtgggagagtgagtctggatgt 
35 ctgagagagcaccttgggagggactcttcctgcaatgctgtaaatacgagtaccgtttta 
ataaagcatgtaca 

>scr_sc-87750810_l (TOXMARKER Assignment: 127; SEQ ID NO: 127) 
tttttttttttttttttggctcctgccatcttttttattggtctgggctgtgggctgggg 

40 gaggcaggtgggctcacatctttatgcaagcagcaaggagacggttcacatgctcaggag 
actccaggaaggccttgagcttgggtcgggctttgagacgcgctacataggcggagagca 
gggggaagtctttcaagtaaccagggaacaggagctctaggttcagaagtaaatccagta 
ggcggtagtcggcgaaggagatctggtcaccaacaatgaagcattggccacccttgttct 
gggccagaagagtttcaaatggcttcaggtgtcctggaagctccttcctatattggccct 

45 tgtcctccttacagatatggagatagtgccatgcaatgcgcctgaacacgtcttccagtc 
cgtcgttcaccatgtccaccagtgctgcctcttgctggtctttgccgtagagcccgaagg 
agtggcccaggtgccgtaggatggcattcgattggtacagagtgagctttccatcctgga 
acttggggatctgcccaaacagacaggaagccttgaatgtgccttgctcccaaacatcca 
aggtcaccacctcctccttccaactctggccctggtcggctagcagcatgcgcataacct 

50 cacagcgcccagtgttggggtgcaggatggggatgaggccacagcgaagagacccaccct 
cagagcatcctgggagagtttgggagactggaaagctgacaagtggactaaactagcttg 
ggagcctcgaagggagggaaaaaatgtggtggtagaggccatgtcctaacattatcttgg 
caagccaagacccagccccaccggcacagggaaggaggaaaagtgacagacagtgtagct 
gcctatggaggctaagaggtcagtcctggccccaccaaccacaattgtagtcccgcccca 

55 agtctcggtcttgcccccaacgtggtcttggccacatccctccagcaccagtgttgaggg 
ggccccaggagtgactatggcttgtgcccttcatcttgaaaac 

>cszr_2 02034260_190929676 (TOXMARKER Assignment: 128; SEQ ID NO: 128) 
gtgcaccagtacctgatgctgggagatgaatggcttagcgctgttctacttggaacatat 
60 cactcctgccagccgggcactaacaattatcacccaatccaggacttaaactgtgataga 
ctggctgatgtttgcctttgaatagagtgtcccaaaagatgggaccactggtcagctgcc 
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atggactagattctccacctgttgggggcaatctggtcaccttgctgcccaatccgacct 
ggagccaccacagcacgagtgtcaagcactggcagaagcccatgggtggaggaaagacct 
ctgcgactggctgattgacccctgctgaaagccgaggctaccttgtccacagacgggaac 
agttctcttcatga 

5 

Table 9B 

>d90282 (TOXMARKER Assignment: 35; SEQ ID NO: 129) 
aagcaaccttaaaatgactgcaccctcccagatttcttttacattaactaaaaagtctta 

10 tcacacaatctcataaaatttatgtaatttcatttaattttagccacaaatcatcaaaat 
gacgaggattttgacagctttcaaagtggtgaggacactgaagactggttttggctttac 
caatgtgactgcacaccaaaaatggaaattttcaagacctggcatcaggctcctttctgt 
caaggcacagacagcacacattgtcctggaagatggaactaagatgaaaggttactcctt 
tggccatccatcctctgttgctggtgaagtggtttttaatactggcctgggagggtaccc 

15 agaagctattactgaccctgcctacaaaggacagattctcacaatggccaaccctattat 
tgggaatggtggagctcctgatactacttctctggatgaactgggacttagcaaatattt 
ggagtctaatggaatcaaggtttcaggtttgctggtgctggattatagtaaagactacaa 
ccactggctggctaccaagagtttagggcaatggctacaggaagaaaaggttcctgcaat 
ttatggagtggacacaagaatgctgactaaaataattcgggataagggtaccatgcttgg 

20 gaagattgaatttgaaggtcagcctgtggattttgtggatccaaataaacagaatttgat 
tgctgaggtttcaaccaaggatgtcaaagtgtacggcaaaggaaaccccacaaaagtggt 
agctgtagactgtgggattaaaaacaatgtaatccgcctgctagtaaagcgaggagctga 
agtgcacttagttccctggaaccatgatttcaccaagatggagtatgatgggattttgat 
cgcgggaggaccggggaacccagctcttgcagaaccactaattcagaatgttcagaagat 

25 tttggagagtgatcgcaaggagccattgtttggaatcagtacaggaaacttaataacagg 
attggctgctggtgccaaaacctacaagatgtccatggccaacagagggcagaatcagcc 
tgttttgaatatcacaaacaaacaggctttcattactgctcagaatcattgctatgcctt 
ggacaacaccctccctgctggctggaaaccactttttgtgaatgtcaacgatcaaacaaa 
tgaggggattatgcatgagagcaaacccttcttcgctgtgcagttccacccagaggtcac 

30 cccggggccaatagacactgagtacctgtttgattcctttttctcactgataaagaaagg 
aaaagctaccaccattacatcagtcttaccgaagccagcactagttgcatctcgggttga 
ggtttccaaagtccttattctaggatcaggaggtctgtccattggtcaggctggagaatt 
tgattactcaggatctcaagctgtaaaagccatgaaggaagaaaatgtcaaaactgttct 
gatgaacccaaacattgcatcagtccagaccaatgaggtgggcttaaagcaagcggatac 

35 tgtctactttcttcccatcacccctcagtttgtcacagaggtcatcaaggcagaacagcc 
agatgggttaattctgggcatgggtggccagacagctctgaactgtggagtagaactatt 
caagagaggtgtgctcaaggaatatggtgtgaaagtcctgggaacttcagttgagtccat 
tatggctacggaagacaggcagctgttttcagataaactaaatgagatcaatgaaaagat 
tgctccaagttttgcagtggaatcgattgaggatgcactgaaggcagcagacaccattgg 

40 ctacccagtgatgatccgttccgcctatgcactgggtgggttaggctcaggcatctgtcc 
caacagagagactttgatggacctcagcacaaaggcctttgctatgaccaaccaaattct 
ggtggagaagtcagtgacaggttggaaagaaatagaatatgaagtggttcgagatgctga 
tgacaattgtgtcactgtctgtaacatggaaaatgttgatgccatgggtgttcacacagg 
tgactcagttgttgtggctcctgcccagacactctccaatgccgagtttcagatgttgag 

45 acgtacttcaatcaatgttgttcgccacttgggcattgtgggtgaatgcaacattcagtt 
tgcccttcatcctacctcaatggaatactgcatcattgaagtgaatgccaagatgtcccc 
gaactctgctctggcctccaaaacgactggctacccattggcattcattgctgcaaagat 
tgccctaggaatcccacttccaggaattaagaacgtcgtatccgggaagacatcagcctg 
ttttgaacctagcctggattacatggtcaccaagattccccgctgggatcttgaccgttt 

50 tcatggaacatctagccgaattggtagctctatgaaaagtgtaggagaggtcatggctat 
tggtcgtacctttgaggagagtttccagaaagctttacggatgtgccacccatctataga 
gggtttcactccccgtctcccaatgaacaaagaatggccatcgaatttagatcttagaaa 
agagttgtctgaaccaagcagcacgcgtatctatgccattgccaaggccattgatgacaa 
catgtcccttgatgagattgagaagctcacatacattgacaagtggtttttgtataagat 

55 gcgtgatattttaaacatggaaaagacactgaaaggcctcaacagtgagtccatgacaga 
agaaaccctgaaaagggcaaaggagattgggttctcagataagcagatttcaaaatgcct 
tgggctcactgaggcccagacaagggagctgaggttaaagaaaaacatccacccttgggt 
taaacagattgatacactggctgcagaatacccatcagtaacaaactatctctatgttac 
ctacaatggtcaggagcatgatgtcaattttgatgaccatggaatgatggtgctaggctg 

60 tggtccatatcacattggcagcagtgtggaatttgattggtgtgctgtctctagtatccg 
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cacactgcgtcaacttggcaagaagacggtggtggtgaattgcaatcctgagactgtgag 
cacagactttgatgagtgtgacaaactgtactttgaagagttgtccttggagagaatcct 
agacatctaccatcaggaggcatgtggtggctgcatcatatcagttggaggccagattcc 
aaacaacctggcagttcctctatacaagaatggtgtcaagatcatgggcacaagccccct 
5 gcagatcgacagggctgaggatcgctccatcttctcagctgtcttggatgagctgaaggt 
ggctcaggcaccttggaaagctgttaatactttgaatgaagcactggaatttgcaaagtc 
tgtggactacccctgcttgttgaggccttcctatgttttgagtgggtctgctatgaatgt 
ggtattctctgaggatgagatgaaaaaattcctagaagaggcgactagagtttctcaggc 
cacgccagtggtgctgacaaaatttgttgaaggggcccgagaagtagaaatggacgctgt 

10 tggcaaagatggaagggttatctctcatgccatctctgaacatgttgaagatgcaggtgt 
ccactcggagaatgccactctgatgctgcccacacaaaccatcagccaaggggccattga 
aaaggtgaaggatgctacccggaagattgcaaaggcttttgccatctctggtccattcaa 
cgtccaatttcttgtcaaaggaaatgatgtcttggtgaatgagtgtaacttgagagcttc 
tcgatccttcccctctgtttccaagactcttggggttgacttcattgatgtggccaccaa 

15 ggtgttgattggagagaatgttgatgagaaacatcttccaacattggaccatcccataat 
tcctgttgactatgttgcaattaaggctcccatgttttcctggccccggttgagggatgc 
tgaccccattctgagatgtgagatggcttccactggagaggtggcttgctttggtgaagg 
tattcatacagccttcctaaaggcaatgctttccacaggatttaagataccccagaaagg 
catcctgataggcatccagcaatcattccggccaagattccttggtgtggctgaacaatt 

20 acacaatgaaggtttcaagctgtttgccacggaagccacatcagactggctcaacgccaa 
caatgtccctgccaacccagtggcatggccgtctcaagaaggacagaatcccagcctctc 
ttccatcagaaaattgattagagatggcagcattgacctagtgattaaccttcccaacaa 
caacactaaatttgtccatgataattatgtgattcggaggacagctgttgatagtggaat 
ccctctcctcactaattttcaggtgaccaaactttttgctgaagctgtgcagaaatctcg 

25 caaggtggactccaagagtcttttccactacaggcagtacagtgctggaaaagcagcata 
gagatgcagacaccccagccccattattaaatcaacctgagccacatgttatataaagga 
actgattcacaactttctcagagatgaatattgataactaaacttcatttcagtttactt 
tgttatgccttaatattctgtgtcttttgcaattaaattgtcagtcacttcttcaaaacc 
ttacagtccttcctaaggttactcttcatgagattcatccatttactaatactgtatttt 

30 tggtggactaggcttgcctatgtgcttatgtgtagctttttactttttatggtgtgatta 
atggtgatcaaggtaggaaaagttgtgttctattttcttgaactccttctatactttaag 
atactctatttttaaaacactatctgcaaactcaggacactttaacagggcagaatactc 
taaaaacttgataaaattaaatatagatttaatttatgaaccttccatcatgtgtttgtg 
tattgcttctttttggatcctcattctcacccatttggctaatccaggaatattgttatc 

35 ccttcccattatattgaagttgagaaatgtgacagagcatttagagtatgaattc 

>xm_114110 (TOXMARKER Assignment: 71; SEQ ID NO: 154) 

gtgccgggaagtggctccagggagaagaggcctcttccctcacccgctgtgggagctgcgccccgaaagcct 
gccccggcacgtcgggctctcctgacccgccaagaccagagagccgttggcgccctccgcccgggcctgccg 

40 gtccgtttattttaagaagctttgtgcgcctgctgtggggatttctgatccaggctgcgaagaatttcgaag 
tctggaaaatagcaactgtgtttgtttctaaaggatcttctcctgacccagcatcgctcatcacaatgaaga 
accaagacaaaaagaacggggctgccaaacaatccaatccaaaaagcagcccaggacaaccggaagcaggac 
ccgagggagcccaggagcggcccagccaggcggctcctgcagtagaagcagaaggtcccggcagcagccagg 
ctcctcggaagccggagggggctcaagccagaacggctcagtctggggcccttcgtgatgtctctgaggagc 

45 tgagccgccaactggaagacatactgagcacatactgtgtggacaataaccaggggggccccggcgaggatg 
gggcacagggtgagccggctgaacccgaagatgcagagaagtcccggacctatgtggcaaggaatggggagc 
ctgaaccaactccagtagtcaatggagagaaggaaccctccaagggggatccaaacacagaagagatccggc 
agagtgacgaggtcggagaccgagaccatcgaaggccacaggagaagaaaaaagccaagggtttggggaagg 
agatcacgttgctgatgcagacattgaatactctgagtaccccagaggagaagctggctgctctgtgcaaga 

50 agtatgctgaactgctggaggagcaccggaattcacagaagcagatgaagctcctacagaaaaagcagagcc 
agctggtgcaagagaaggaccacctgcgcggtgagcacagcaaggccgtcctggcccgcagcaagcttgaga 
gcctatgccgtgagctgcagcggcacaaccgctccctcaaggaagaaggtgtgcagcgggcccgggaggagg 
aggagaagcgcaaggaggtgacctcgcacttccaggtgacactgaatgacattcagctgcagatggaacagc 
acaatgagcgcaactccaagctgcgccaagagaacatggagctggctgagaggctcaagaagctgattgagc 

55 agtatgagctgcgcgaggagcatatcgacaaagtcttcaaacacaaggacctacaacagcagctggtggatg 
ccaagctccagcaggcccaggagatgctaaaggaggcagaagagcggcaccagcgggagaaggattttctcc 
tgaaagaggcagtagagtcccagaggatgtgtgagctgatgaagcagcaagagacccacctgaagcaacagc 
ttgccctatacacagagaagtttgaggagttccagaacacactttccaaaagcagcgaggtattcaccacat 
tcaagcaggagatggaaaagatgactaagaagatcaagaagctggagaaagaaaccaccatgtaccggtccc 

60 ggtgggagagcagcaacaaggccctgcttgagatggctgaggagaaaacagtccgggataaagaactggagg 
gcctgcaggtaaaaatccaacggctggagaagctgtgccgggcactgcagacagagcgcaatgacctgaaca 
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agagggtacaggacctgagtgctggtggccagggctccctcactgacagtggccctgagaggaggccagagg 
ggcctggggctcaagcacccagctcccccagggtcacagaagcgccttgctacccaggagcaccgagcacag 
aagcatcaggccagactgggcctcaagagcccacctccgccagggcctagagagcctggtgttgggtcatgc 
tgggaagggagcggcagcccagccaggcctggcccataaaaggctcccatgctgagcagcccattgctgaag 
ccaggatgttctgacctggctggcatctggcacttgcaattttggattttgtgggtcagttttacgtacata 
gggcattttgcaaggccttgcaaatgcatttatacctgtaagtgtacagtgggcttgcattggggatggggg 
tgtgtacagatgaagtcagtggcttgtctgtgagctgaagagtcttgagaggggctgtcatctgtagctgcc 
atcacagtgagttggcagaagtgacttgagcatttctctgtctgatttgaggctcagacccctccctgccct 
tcagagctcaagacaagtaatacacccaggtcttgactgcatttgtcttgtgagcagggcttgcttggtcag 
ctcaggccctcctagctgctctggaggctcctttgattctctagacctggaaaaggtgtccctaggcagagc 
cctggcagggcgctcagagctggggatttcctgcctggaacaagggacctggagaatgtttttgcgtgggat 
gatgtgctggtcaggagccccttgggcatcgcttcccctgccctttggtagtgccaggaccaggccaatgat 
gcttctcagtagccttatcattcacaggtgcctctctagcctgcacaaatgattgacaagagatcacccaaa 
ggattatttctgaaggtgtttttttctttatttctttttctttttttttttttttctttttctttttttttt 
gcacatgacagtgtttgtattgaggaccttccaaggaagagggatgctgtagcagtggtgcctgggtgcctg 
gcctccagtgtcccacctccttcaccaccccacttggctcctttgccatcttgatgctgaggtttcctgttt 
ggtgagatcaggttgtttgtggtaaaagaaaggaaagggcttctgatggctttgccacaagcttacctgtgg 
gtttcagtcctgagaggccaccaccagttcccatcagcactgtctccatgcagcagttgctgggtcccatgt 
ccagctgcctctttggcttcatgggtttttctgcttcctgcccccacccccacatgtgcaatcctcaagatt 
tgtcctgattctatttcctggcacctccctgcctgtccttggggattctacttcttcctgtgtgggagccca 
tagctgttgtctaacaggtaagaaatgaaattgaactattgactgggccccagaaatccataaaatggctgc 
agacagttgtttctgtgtcctgttctacccccactccagtacataactactatgtactgtgtagagccattc 
tatatgctgaatgttctgctgttgcaaacttgccagggtattagccagtgtttgtgccaagcagttttctgg 
gacaacagaatgactcagaccaagatggataggatggttagggctttgcttcttgctgtttttctttgaagc 
tagttcattgtcctgcaggtcccttcatcttccatacctagcccactcttttagcccttaccttaaatctct 
cagataagttggttcacaaagaatgttaagtactgaatcatgtgtgactgagaccagagatggcaaatgaat 
ggcacaccatttctccttctcctgccccagggcaggtaccactgatctgcatcagagttgcctgctattctc 
tggtgtatccttcacatctaggtgccctcaagcagctgtgtgagtgttgagatctctgccatctctggctga 
gatactgctgtcctgtgaagtgtttcccatgacctttttcttcccctttgaatccctctgtctggagtagtc 
cttgcctcttcctgctccagtagggccttttccctaccccagcccctgtgccaggctaagctggtacaagag 
ctgccaacctcacagagtgtttgctaggcgagagaggtgcagggaagaggcagaggtatgcaccttccccct 
tgaagagaggggaaaggcctacagtggcccacataattgcctgactcacacttcagctacctcttaatgcct 
gtggagggactggagctgctggatcccagtgtggtggtgtaggaggccacagtgagcaggtggccccagctg 
ggtttcccaggtcaggaatgtgggccccaggcaaggtgcagcctttgctcacagctccatccatgtctagac 
cttcaggccagtctgcagatgaggttccctacctttttcttctcttcattgaccaaatcaaccaatcactac 
agctgctctgcttctgctttccaaagtagcccaggtcctgggccagatgcaggggaggtgcctatccatgag 
tgaaggccagtgtcttcctcacctgggtgggtcccacacttgtgacctcagttttaggaccaagatctgtgt 
tggtttcttagattgctagcttttcctccaggggaccacagcaggtgaagctcaagagcgcatggctctgct 
aatagtaaattgttttcagggccttgtccagctgagagcttcatgtccaccagattctgagaggtgtcagca 
gcactttttttttttatttgttgtttgttttccatgaggttatcggaccatgggctgagctcaggcactttc 
tgtaggagactgttatttctgtaaagatggttatttaaccctcctccaccccatcacggtggccctgagggc 
tgacccggaggccagtggagctgcctggtgtccacgggggagggccaaggcctgctgagctgattctccagc 
tgctgccccagcctttccgccttgcacagcacagaggtggtcaccccagggacagccaggcacctgctcctc 
ttgcccttcctgggggaagggagctgccttctgtccctgtaactgctttccttatggcccagcccggccact 
cagacttgtttgaagctgcactggcagcttttttgtctcctttgggtattcacaacagccagggacttgatt 
ttgatgtattttaaaccacattaaataaagagtctgttgcctt 

>ab032968.1 (TOXMARKER Assignment: 72; SEQ ID NO: 164) 

ggcggtgcaagagagctgagggaggcgcgagggcgcggagttccaggtcgagcagttagg 

ccgcgagcgactgcggcgccgagccgatgagtaacccgaagcccctagaggagtggtcac 

ctgcctgagggcacttctgtcccaccagcatcagaccaggccgcaccgagtccccggcac 

catgtttgggaagaggaagaagcgggtggagatctccgcgccgtccaacttcgagcaccg 

cgtgcacacgggcttcgaccagcacgagcagaagttcacggggctgccccgccagtggca 

gagcctgatcgaggagtcggctcgccggcccaagcccctcgtcgaccccgcctgcatcac 

ctccatccagcccggggcccccaagggggagcctcatgacgtggcccctaacgggccatc 

agcggggggcctggccatcccccagtcctcctcctcctcctcccggcctcccacccgagc 

ccgaggtgcccccagccctggagtgctgggaccccacgcctcagagccccagctggcccc 

tccagcctgcacccccgccgcccctgctgttcctgggccccctggcccccgctcaccaca 

gcgggagccacagcgagtatcccatgagcagttccgggctgccctgcagctggtggtgga 

cccaggcgacccccgctcctacctggacaacttcatcaagattggcgagggctccacggg 

catcgtgtgcatcgccaccgtgcgcagctcgggcaagctggtggccgtcaagaagatgga 
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cctgcgcaagcagcagaggcgcgagctgctcttcaacgaggtggtaatcatgagggacta 
ccagcacgagaatgtggtggagatgtacaacagctacctggtgggggacgagctctgggt 
ggtcatggagttcctggaaggaggcgccctcaccgacatcgtcacccacaccaggatgaa 
cgaggagcagatcgcggccgtgtgccttgcagtgctgcaggccctgtcggtgctccacgc 
5 ccagggcgtcatccaccgggacatcaagagcgactcgatcctgctgacccatgatggcag 
ggtgaagctgtcagactttgggttctgcgcccaggtgagcaaggaagtgccccgaaggaa 
gtcgctggtcggcacgccctactggatggccccagagctcatctcccgccttccctacgg 
gccagaggtagacatctggtcgctggggataatggtgattgagatggtggacggagagcc 
cccctacttcaacgagccacccctcaaagccatgaagatgattcgggacaacctgccacc 

10 ccgactgaagaacctgcacaaggtgtcgccatccctgaagggcttcctggaccgcctgct 
ggtgcgagaccctgcccagcgggccacggcagccgagctgctgaagcacccattcctggc 
caaggcagggccgcctgccagcatcgtgcccctcatgcgccagaaccgcaccagatgagg 
cccagcgcccttcccctcaaccaaagagccccccgggtcacccccgccccactgaggcca 
gtagggggccaggcctcccactcctcccagcccgggagatgctccgcgtggcaccaccct 

15 ccttgctgggggtagatgagaccctactactgaactccagttttgatctcgtgactttta 
gaaaaacacagggactcgtgggagcaagcgaggctcccaggacccccaccctctgggaca 
ggccctcccccatgttcttctgtctccaggaagggcagcggccctcccatcactggaagt 
ctgcagtgggggtcgctgggggtggagagaacactaagaggtgaacatgtatgagtgtgt 
gcacgcgtgtgagtgtgcatgtgtgtgtgtgcaaaggtccagccaccccgtcctccagcc 

20 tgcaaggggtgtctggcgccttgcctgacacccagccccctctccccctgagccattgtg 
ggggtcgatcatgaatgtccgaagagtggccttttcccgtagccctgcgccccctttctg 
tggctggatggggagacaggtcagggccccccaccctctccagcccctgcagcaaatgac 
tactgcacctggacagcctcctcttttctagaagtctatttatattgtcattttataaca 
ctctagcccctgcccttattgggggacagatggtccctgtcctgcggggtggccctggca 

25 gaaccactgcctgaagaaccaggttcctgcccggtcagcgcagccccagcccgcccaccc 
ctgcctcgagttagttttacaattaaaacattgtcttgttttgtgtctgtgtgcgatgtg 
tggggggcagggggccctgcccggctgtcttgggtgggaatttgcagggagagggtctgg 
atctgggagcaaaccacgattccagccaaggcagggcaagggtggggtggggagtgggga 
gttcaggtcatagcagccagtaagctcccccagcctgccactccccagaatggggcagga 

30 ttgtccccacccctggaagcagccagtttgccacagtccatgtgcagactgatcccagtt 
tgccaaatctgcaatttcctggaaccttttaaaggctgtcttgagcgcgtttggtgagta 
ggagctaacccaagttagtaaattgaaggccatttggcaaattggtcagtgggcagatgg 
gcttttggggattgactgaggctgactggcctggagctgctggcttcggagagacaccct 
gtgaagtgtgtccttccacgcaggagcccagagccgagcccacgctggggggaatctgac 

35 tggcatggaggtggccatgccaccatcgctgctgcagctgcatcctggcactttgcgcct 
caggccctgttgggctccactttctgcatcctccccagcccccagggaggcagtggagtg 
gggagagagccaggagtgagcctccgtccccaaagccagccaggcgtcatcagcaccaga 
gacctcagcctggtcctctcgggaagtgagtggccagggcagagattccaggttagtcca 
cgcctcccacccttcacaggtcctgaccccaagaatcagagcactgtgtgtgtggcaggg 

40 cctatgccaagtgcaaacacagcctagatggatcatcacagagtgaaacccagcggtgca 
agcagctgtgctctctgcgatgtattggaggcttaggtgaggtggatgcctttctggaaa 
aaaaaaaaatgctaacattggcaaaagaagaaatagaaaacaagaccaaaataactgtct 
cctcactgcacacacactccagaataaataaaaggtttcaggcttgaatgcactttcaaa 
tgagattttttttttttttttgagacggagtatcgctctgtcgcccaggctggagtgcgg 

45 tggcacggtcttggctcgctgcaacttctgcctcctgggttcaagcgattctcctgcctc 
agcctcccaggtggctgggattgcaggcgcttgccaccacgccgggctagttttttgtat 
tttgggtagagactgggtttcgccatcttggccagactggtcttgaactcctgacctcgt 
gatccgcccacctcagcctcccaaggtgctgggattgcaggcatgagccaccatgcctgg 
cctcaaatgaggtttaccagactttgaaggagcaggtaattccttctaccttgtgaacaa 

50 gtcgttccagaaagatagcagctcaggaggcctctgtgaccatggttccagacccagata 
aggacggcaaagaacagagcatctcagaaacgcaaggctcacagccagggtgcccggcga 
ccccacgggcactgagaacagctagctctaggagctccactctcctgctgaagaaaccac 
gggctcagagacggggagctccctcgcccagccacatctgtgaccccacggtaactctgc 
tggttttggtgccttcagtcactcactgcaggtttgttttgttttgttttgttttgtttt 

55 gagtttttttgttttttgttttttgtttttagagacgggatgttgctatgttgcctaggc 
tggtttcgaactcctggcctcaggggatcctccctccttggccccgcaaggtgctgggat 
tgcaggcgtgagccgccgtgcccggcccgctcactgcagtttgaaggcatggctttgggt 
ggcgtggggtgaaagctgcccgaggccccgttcctccccacgtggctgcctcctgccaga 
gccagtcaggaaaacagaccccaactagagttgtttcaaatggcagggatttggtaccgg 

60 tggttggatcatgacaaagctctgagaaggctggaggagccacagagtgccaagtgccca 
gcaatcattagaggaaggaggctgctgccacctgtgtggctagaggaacagaggggccaa 
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tggcattccccaaaccccactctcgcctctgtctggccagagcagaatggcttcttccag 
cttcccaccctggactcccacccaggagcctcctcctggcagacccttcctgaccccacc 
ggcccggggggtctacagatccatgtttcaggcgtccgcctggagcggaacaggggagtg 
cttaggacaagggtggtgccagaggatccactctgcccacatttagttgaccagctgagg 
5 cactccacgggaatgaatgactctcgacaggtgccggaggtgaggaggggcccggaggcc 
caggaggggcacagggatggattcgtccgcctgggggctggaggtgtgtttacagagccc 
caaaataaacaatgcaaccaggtcagaccagcggttctcacacagtgtggttcccagacc 
agcagcatcactgggagcttactggacacgcaaataaatccctgtgtgccaccccagctg 
catcagatgctggtggggcccagtgatctgtatttaacacaccctccgggggatgccggt 

10 gcccactcacgtttgagaacccctgcgatccacgactgccctcccgtgtaaaaggcccac 
ctctgtgggactccaagtcatcagcaccctagggtccttccgtctttttccttcctcctg 
ggacacctgcctctcccatgtcgtattagagaattccttatgctcccaagtgggcacggg 
gagaggaaggcactcctctttaaggaccgacccagaggttttgccattgcttcactggcc 
agagcttagtcacgcagcctcacccagaggcaagggaggttggaaaatgtagtgtttgtg 

15 tgtgtctaacacaaattctattaccatgcagtcaggattctccactcttgctctttcatt 
agatttgctgggcttcaccctggactttctgatttagtgacagaacagagaacccagagg 
cagacccagatgtgtacaagggcttcatatacaatcaggagatttaataatcatgctagg 
ggccgggtgcagtggctcacgcctgtaatcccaagcactttggggagccgaggcaggcgg 
atcacttgaggtcaggagtttgagaccagcctgggcaacaaagtgagaccctgtctctac 

20 taaaaataacaaaaattagccgggcgtggtggtgggtgcctgtaatcccagctcctcggg 
tggctgaggcatgagaatcacttgaacccaggaggcagaggtttcagtgagctgagatca 
catcactgcactccagcctgggtgacagagtgagattccgtc 

>ml4328.1 (TOXMARKER Assignment: 37; SEQ ID NO : 168) 

25 acggagatctcgccggctttacgttcacctcggtgtctgcagcaccctccgcttcctctc 
ctaggcgacgagacccagtggctagaagttcaccatgtctattctcaagatccatgccag 
ggagatctttgactctcgcgggaatcccactgttgaggttgatctcttcacctcaaaagg 
tctcttcagagctgctgtgcccagtggtgcttcaactggtatctatgaggccctagagct 
ccgggacaatgataagactcgctatatggggaagggtgtctcaaaggctgttgagcacat 

30 caataaaactattgcgcctgccctggttagcaagaaactgaacgtcacagaacaagagaa 
gattgacaaactgatgatcgagatggatggaacagaaaataaatctaagtttggtgcgaa 
cgccattctgggggtgtcccttgccgtctgcaaagctggtgccgttgagaagggggtccc 
cctgtaccgccacatcgctgacttggctggcaactctgaagtcatcctgccagtcccggc 
gttcaatgt cat caatggcggttctcatgctggcaacaagctggccatgcaggagt teat 

35 gatcctcccagtcggtgcagcaaacttcagggaagccatgcgcattggagcagaggttta 
ccacaacctgaagaatgtcatcaaggagaaatatgggaaagatgccaccaatgtggggga 
tgaaggcgggtttgctcccaacatcctggagaataaagaaggcctggagctgctgaagac 
tgctattgggaaagctggctacactgataaggtggtcatcggcatggacgtagcggcctc 
cgagttcttcaggtctgggaagtatgacctggacttcaagtctcccgatgaccccagcag 

40 gtacatctcgcctgaccagctggctgacctgtacaagtccttcatcaaggactacccagt 
ggtgtctatcgaagatccctttgaccaggatgactggggagcttggcagaagttcacagc 
cagtgcaggaatccaggtagtgggggatgatctcacagtgaccaacccaaagaggatcgc 
caaggccgtgaacgagaagtcctgcaactgcctcctgctcaaagtcaaccagattggctc 
cgtgaccgagtctcttcaggcgtgcaagctggcccaggccaatggttggggcgtcatggt 

45 gtctcatcgttcgggggagactgaagataccttcatcgctgacctggttgtggggctgtg 
cactgggcagatcaagactggtgccccttgccgatctgagcgcttggccaagtacaacca 
gctcctcagaattgaagaggagctgggcagcaaggctaagtttgccggcaggaacttcag 
aaaccccttggccaagtaagctgtgggcaggcaagccttcggtcacctgttggctacaca 
gacccctcccctcgtgtcagctcaggcagctcgaggcccccgaccaacacttgcaggggt 

50 ccctgctagttagcgccccaccgccgtggagttcgtaccgcttccttagaacttctaGag 
aagccaagctccctggagccctgttggcagctctagcttttgcagtcgtgtaatgggccc 
aagtcattgtttttctcgcctcactttccaccaagtgtctagagtcatgtgagcctcgtg 
tcatctccggggtggccacaggctagatccccggtggttttgtgctcaaaataaaaagcc 
tcagtgacccatgag 

>xm_035635 (TOXMARKER Assignment: 68; SEQ ID NO: 169) 

99999 a c99tgaaggttgcctcccgcccgtccgggctctgatcctccgtctccccgtcccccggcggccggc 
ccatggcctggcggaggcccgaaccatggacctccgcaccgccgtgtacaacgccgcccgtgatggcaagct 
gcagctgctccagaagctgctcagcggccggagccgggaggaactggacgagctgacgggcgaggtggccgg 
60 cgggggaacgccgctactcatcgccgcccgctacggccacctggacgtggtggagtacctggtggaccggtg 
cggcgcgagcgtggaggccggtggctcggtgcacttcgatggcgagaccatcgagggcgcgccgccgctgtg 
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ggccgcctccgcagccggccacctggacgtggtgcggagcctgctgcgccgcggggcctcggtgaaccgcac 
cacgcgcaccaactccacgcctctccgcgccgcctgcttcgacggccacctggaggtggtgcgctacctggt 
cggcgagcaccaggccgacctggaggtggccaaccggcacggccacacgtgcctcatgatctcgtgctacaa 
gggccaccgtgagatcgcccgctacctgctggagcagggcgcccaggtgaaccggcgcagcgccaagggcaa 
5 cacggccctgcatgactgcgccgagtccggcagcctggagatcctgcagctgctgctggggtgcaaggcccg 
catggaacgtgacggctacggcatgaccccgctgctcgcggccagcgtgacgggccacaccaacatcgtgga 
gtacctcatccaggagcagcccggccaggagcaggtcgcagggggagaggctcagcctgggctgccccaaga 
agacccctccaccagccaggggtgtgcgcagcctcagggggctccgtgctgcagctcctccccagaggaacc 
actgaacggggaatcttacgaaagctgctgtcccaccagccgggaagctgccgtggaagccttggaattgct 

10 gggagctacgtatgtggataagaaacgagatctgcttggggcccttaaacactggaggcgggccatggagct 
gcgtcaccaggggggcgagtacctgcccaaaccggagcccccacagctggtcctggcctatgactattccag 
ggaggtcaacaccaccgaggagctggaggcgctgatcaccgacccggatgagatgcgcatgcaggccctgtt 
gatccgggagcgcatcctcggtccctcgcacccggacacttcctattacatccgttacaggggtgccgtgta 
cgccgactcgggcaatttcgagcgctgcatccgcttgtggaagtacgccctggacatgcaacagagcaacct 

15 ggagcctctgagccccatgaccgccagcagcttcctctccttcgcggaactcttctcctacgtgcttcagga 
ccgggccgccaaaggcagcctgggcacccagatcggctttgcagacctcatgggggttctcaccaaaggggt 
ccgggaagtggaacgggccctgcagctgcccagggagcccggagactcagcccagttcaccaaggcgctggc 
catcatcctccacctgctctacctgctggagaaagtggagtgcacccccagccaggagcacctgaagcacca 
gaccgtctaccgcctgctcaagtgcgcgcccaggggcaagaacggcttcacccctctgcacatggctgtgga 

20 caaggacaccacaaacgtgggccgctatcccgtgggcagattcccctccctgcacgtggtcaaagtgctgct 
cgactgcggggccgacccggacagcagggattttgacaacaacaccccgctacacatagcagcccagaacaa 
ctgcccggccatcatgaatgccctgatcgaagcaggggcccacatggacgccaccaatgccttcaagaagac 
ggcctacgagctgctggacgagaagctgctggccaggggtaccatgcagcccttcaactacgtgaccctgca 
gtgccttgcggcccgggccctggataagaacaagatcccttacaagggcttcatcccggaagatctggaggc 

25 gttcatcgaactgcactgacctgcccagaacgcctgcaccctcacctctcccctctcctgctgagatggggg 
aaatccggctgcggcatagcagatgctcgttcttgcctccttcaggcaccaatcaggagaagggttctgcct 
cccatcccctctacctgcagacagggtcggaggtgttagcgagcctttggtgctagaagcctgcggggtcat 
gtgctaagaggacagtctttctccgggagcccgctcactcattctgagttaggaaaagacacaagaccttcc 
ccacatcctgtctgcctgggttagggaggcctttgccttgttacctagaggcggagggactgaagccattgc 

30 gttccttccctgctagaaacacaggaagaagttgaggactgtctgccttccctcgtccctttacctggccag 
ataactccagccgctgaatacagtgttaggactgggggctcctgagatgagagtttgagattcagggaatga 
gaccacctctcatttcttccagcatgatcgcgccctgctcccgtgccaccgtagtccctggcagacaggcag 
ggctctgcccagggcagcctgccacttgcatagctttcggttggtttggtgttctgtttatttaataagtgg 
gcaggttgcaagcgttgcacagaaattctgagattttactgcctttttttttttttttaagaaagttgtttg 

35 ttggactccataagtgaatttcaagcagtgaggattttgtggtgcctgagatggccgagggcacagggagtg 
agctgtatgtgtgaggaatttggtgagcgagataaaagtccacggtgtcaacccctaaaacatgggtgaccg 
tacatttttatacatctccactctacggccttttacaggctttccgattttacaggcctttccaagtttcca 
ttctccttagagagagaactgtgcttccaaacagaaatcaggagtgaccacaaagcctgaaaacactttgcc 
acccagcaaagaactggcacaattggtttgggtctgcattgccatagtgcccgagttaaaactgcaggccac 

40 tctgccttgcagtggcctctgatttcattgtgggtgcatccacaggtggcccgagctgttctttcagctgct 
ccaaggattgagacccaagtcatcatgaaaaaggcccaagtacagtcttaatgcgataaatccactagctaa 
gacgtcgagtgccaagaccagccttccagccgaggtttggacaaagtctcaggttcccgtgactcagggtaa 
ggtgctggggctgccagaggacctgccccagcaagatttttgtcaagagcgagactccatcagcccaggcag 
acgggagcaggttcttggccagcgtagacagcagcaaacagcagcagggaagccattctcactgcatcctcc 

45 ctgcagtagccacggccaggcccttaggaggagcagtgaccgggggtgtccagaaatatcctgtccctggat 
ggaaactaggtctcgtttggatttttttttttttttgccgtgttaggaaattatttattaatttacaagaca 
ggttttaactcagccgaggtgggaaatggtgtccctgtccctcccaaagcacagagcacagaaatgaggccg 
tttacatggcgagtctccgtgctggtgtttaagtcattaaaaagatactcaaagag 

50 >bc020744 (TOXMARKER Assignment: 55; SEQ ID NO: 170) 

acaggatctgcttagtgaaagaagtggcaagcaatggatcccaaatatcagcgtgtagag 
ctaaatgatggtcacttcatgcccgtattgggatttggcacctatgcacctccagaggtt 
ccgaggaacagagctgtagaggtcaccaaattagcaatagaagctggcttccgccatatt 
gattctgcttatttatacaataatgaggagcaggttggactggccatccgaagcaagatt 

55 gcagatggcagtgtgaagagagaagacatattctacacttcaaagctttggtgcactttc 
tttcaaccacagatggtccaaccagccttggaaagctcactgaaaaaacttcaactggac 
tatgttgacctctatcttcttcatttcccaatggctctcaagccaggtgagacgccacta 
ccaaaagatgaaaatggaaaagtaatattcgacacagtggatctctctgccacatgggag 
gtcatggagaagtgtaaggatgcaggattggccaagtccatcggggtgtcaaacttcaac 

60 tacaggcagctggagatgatcctcaacaagccaggactcaagtacaagcctgtctgcaac 
caggtagaatgtcatccttacctcaaccagagcaaactgctggatttctgcaagtcaaaa 
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gacattgttctggttgcccacagtgctctgggaacccaacgacataaactatgggtggac 
ccaaactccccagttcttttggaggacccagttctttgtgccttagcaaagaaacacaaa 
cgaaccccagccctgattgccctgcgctaccagctgcagcgtggggttgtggtcctggcc 
aagagctacaatgagcagcggatcagagagaacatccaggtttttgaattccagttgaca 
5 tcagaggatatgaaagttctagatggtctaaacagaaattatcgatatgttgtcatggat 
tttcttatggaccatcctgattatccattttcagatgaatattagcatagagggtgttgc 
acgacatctagcagaaggccctgtgtgtggatggtgatgcagaggatgtctctatgctgg 
tgactggacacacggcctctggttaaatccctcccctcctgcttggcaacttcagctagc 
tagatatatccatggtccagaaagcaaacataataaatttttatcttgaactaaaaaaaa 
10 aaaaaaaaaaaaaaaaaaa 

>x52425.1 (TOXMARKER Assignment: 43; SEQ ID NO: 133) 

ggcgaatggagcaggggcgcgcagataattaaagatttacacacagctggaagaaatcat 

agagaagccgggcgtggtggctcatgcctataatcccagcacttttggaggctgaggcgg 

15 gcagatcacttgagatcaggagttcgagaccagcctggtgccttggcatctcccaatggg 
gtggctttgctctgggctcctgttccctgtgagctgcctggtcctgctgcaggtggcaag 
ctctgggaacatgaaggtcttgcaggagcccacctgcgtctccgactacatgagcatctc 
tacttgcgagtggaagatgaatggtcccaccaattgcagcaccgagctccgcctgttgta 
ccagctggtttttctgctctccgaagcccacacgtgtatccctgagaacaacggaggcgc 

20 ggggtgcgtgtgccacctgctcatggatgacgtggtcagtgcggataactatacactgga 
cctgtgggctgggcagcagctgctgtggaagggctccttcaagcccagcgagcatgtgaa 
acccagggccccaggaaacctgacagttcacaccaatgtctccgacactctgctgctgac 
ctggagcaacccgtatccccctgacaattacctgtataatcatctcacctatgcagtcaa 
catttggagtgaaaacgacccggcagatttcagaatctataacgtgacctacctagaacc 

25 ctccctccgcatcgcagccagcaccctgaagtctgggatttcctacagggcacgggtgag 
ggcctgggctcagtgctataacaccacctggagtgagtggagccccagcaccaagtggca 
caactcctacagggagcccttcgagcagcacctcctgctgggcgtcagcgtttcctgcat 
tgtcatcctggccgtctgcctgttgtgctatgtcagcatcaccaagattaagaaagaatg 
gtgggatcagattcccaacccagcccgcagccgcctcgtggctataataatccaggatgc 

30 tcaggggtcacagtgggagaagcggtcccgaggccaggaaccagccaagtgcccacactg 
gaagaattgtcttaccaagctcttgccctgttttctggagcacaacatgaaaagggatga 
agatcctcacaaggctgccaaagagatgcctttccagggctctggaaaatcagcatggtg 
cccagtggagatcagcaagacagtcctctggccagagagcatcagcgtggtgcgatgtgt 
ggagttgtttgaggccccggtggagtgtgaggaggaggaggaggtagaggaagaaaaagg 

35 gagcttctgtgcatcgcctgagagcagcagggatgacttccaggagggaagggagggcat 
tgtggcccggctaacagagagcctgttcctggacctgctcggagaggagaatgggggctt 
ttgccagcaggacatgggggagtcatgccttcttccaccttcgggaagtacgagtgctca 
catgccctgggatgagttcccaagtgcagggcccaaggaggcacctccctggggcaagga 
gcagcctctccacctggagccaagtcctcctgccagcccgacccagagtccagacaacct 

40 gacttgcacagagacgcccctcgtcatcgcaggcaaccctgcttaccgcagcttcagcaa 
ctccctgagccagtcaccgtgtcccagagagctgggtccagacccactgctggccagaca 
cctggaggaagtagaacccgagatgccctgtgtcccccagctctctgagccaaccactgt 
gccccaacctgagccagaaacctgggagcagatcctccgccgaaatgtcctccagcatgg 
ggcagctgcagcccccgtctcggcccccaccagtggctatcaggagtttgtacatgcggt 

45 ggagcagggtggcacccaggccagtgcggtggtgggcttgggtcccccaggagaggctgg 
ttacaaggccttctcaagcctgcttgccagcagtgctgtgtccccagagaaatgtgggtt 
tggggctagcagtggggaagaggggtataagcctttccaagacctcattcctggctgccc 
tggggaccctgccccagtccctgtccccttgttcacctttggactggacagggagccacc 
tcgcagtccgcagagctcacatctcccaagcagctccccagagcacctgggtctggagcc 

50 gggggaaaaggtagaggacatgccaaagcccccacttccccaggagcaggccacagaccc 
ccttgtggacagcctgggcagtggcattgtctactcagcccttacctgccacctgtgcgg 
ccacctgaaacagtgtcatggccaggaggatggtggccagacccctgtcatggccagtcc 
ttgctgtggctgctgctgtggagacaggtcctcgccccctacaacccccctgagggcccc 
agacccctctccaggtggggttccactggaggccagtctgtgtccggcctccctggcacc 

55 ctcgggcatctcagagaagagtaaatcctcatcatccttccatcctgcccctggcaatgc 
tcagagctcaagccagacccccaaaatcgtgaactttgtctccgtgggacccacatacat 
gagggtctcttaggtgcatgtcctcttgttgctgagtctgcagatgaggactagggctta 
tccatgcctgggaaatgccacctcctggaaggcagccaggctggcagatttccaaaagac 
ttgaagaaccatggtatgaaggtgattggccccactgacgttggcctaacactgggctgc 

60 agagactggaccccgcccagcattgggctgggctcgccacatcccatgagagtagagggc 
actgggtcgccgtgccccacggcaggcccctgcaggaaaactgaggcccttgggcacctc 
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gacttgtgaacgagttgttggctgctccctccacagcttctgcagcagactgtccctgtt 
gtaactgcccaaggcatgttttgcccaccagatcatggcccacgtggaggcccacctgcc 
tctgtctcactgaactagaagccgagcctagaaactaacacagccatcaagggaatgact 
tgggcggccttgggaaatcgatgagaaattgaacttcagggagggtggtcattgcctaga 
5 ggtgctcattcatttaacagagcttccttaggttgatgctggaggcagaatcccggctgt 
caaggggtgttcagttaaggggagcaacagaggacatgaaaaattgctatgactaaagca 
gggacaatttgctgccaaacacccatgcccagctgtatggctgggggctcctcgtatgca 
tggaacccccagaataaatatgctcagccaccctgtgggccgggcaatccagacagcagg 
cataaggcaccagttaccctgcatgttggcccagacctcaggtgctagggaaggcgggaa 
10 ccttgggttgagtaatgctcgtctgtgtgttttagtttcatcacctgttatctgtgtttg 
ctgaggagagtggaacagaaggggtggagttttgtataaataaagtttctttgtctc 

>j 02 906 (TOXMARKER Assignment: 48; SEQ ID NO: 140) 
gcaggctcagcgcatcccagccagtgtctcctgcagctcagcagctgccttcaccatgga 

15 cagcataagcacagccatcttactcctgctcctggctctcgtctgtctgctcctgaccct 
aagctcaagagataagggaaagctgcctccgggacccagacccctctcaatcctgggaaa 
cctgctgctgctttgctcccaagacatgctgacttctctcactaagctgagcaaggagta 
tggctccatgtacacagtgcacctgggacccaggcgggtggtggtcctcagcgggtacca 
agctgtgaaggaggccctggtggaccagggagaggagtttagtggccgcggtgactaccc 

20 tgcctttttcaactttaccaagggcaatggcatcgccttctccagtggggatcgatggaa 
ggtcctgagacagttctctatccagattctacggaatttcgggatggggaagagaagcat 
tgaggagcgaatcctagaggagggcagcttcctgctggcggacgtgcggaaaactgaagg 
cgagccctttgaccccacgtttgtgctgagtcgctcagtgtccaacattatctgttccgt 
gctcttcggcagccgcttcgactatgatgatgagcgtctgctcaccattatccgccttat 

25 caatgacaacttccaaatcatgagcagcccctggggcgagttgtacgacatcctagaccc 
cagattcccgagcctcctggactgggtgcctgggccgcaccaacgcatcttccagaactt 
caagtgcctgagagacctcatcgcccacagcgtccacgaccaccaggcctcgtctccccg 
ggacttcatccagtgcttcctcaccaagatggcagaggagaaggaggacccactgagcca 
cttccacatggataccctgctgatgaccacacataacctgctctttggcggcaccaagac 

30 ggtgagcaccacgctgcaccacgccttcctggcactcatgaagtacccaaaagttcaagc 
ccgcgtgcaggaggagatcgacctcgtggtgggacgcgcgcggctgccggcgctgaagga 
ccgcgcggccatgccttacacagacgcggtgatccacgaggtgcagcgctttgcagacat 
catccccatgaacttgccgcaccgcgtcactagggacacggcctttcgcggcttcctgat 
acccaagggcaccgatgtcatcaccctccttaacaccgtccactacgaccccagccagtt 

35 cctgacgccccaggagttcaaccccgagcattttttggatgccaatcagtccttcaagaa 
gagtccagccttcatgcccttctcagctgggcgccgtctgtgcctgggagagctgctggc 
gcgcatggagctctttctgtacctcaccgccatcctgcagagcttttcgctgcagccgct 
gggtgcgcccgaggacatcgacctgaccccactcagctcaggtcttggcaatttgccgcg 
gcctttccagctgtgcctgcgcccgcgctaacgccccggcccttccagattcgcctgtga 

40 gcgatgaggcccacccatgtgggttgctacgtccccttcttggtccacagtctgccctca 
tccctctggcagtcacgctgtcttccctgcatgctgtgcctgccgcgtgcccttccccca 
tccctccaatctgtgccccgtctgcagggcagaggcagatgtggcatgtctttttgtacc 
cacagagcttgttctatggcacgcccttttctaggctttttgtatcatttcttagtacat 
tgtaatagattcaaaccagtcttgg 

>bc001517 (TOXMARKER Assignment: 59; SEQ ID NO: 145) 
ggcacgaggcaggcttcatttggagtcaggcctggctgttgctcaggtgaccagcttgtg 
tctctgggagggcgctgctttccccggccacccggcgcgatgatccagaatgtcggaaat 
cacctgcgacggctctctttggaaggagaaattgcagaacccctgggctggtttattatc 

50 agcagcaatctccaaacaagccccaggggctgtcagatcccccacgggctggctgtgtag 
• agtggaatcaccttcagcaagtgtcggcctctggaattctattcgggcttggcctctgtg 
ttctccaaccacacatcccggaagtcagccttacgtgcggggaacgacagtgccatggca 
gacggcgagggataccggaaccccacggaggtgcagatgagccagctggtgctgccctgc 
cacaccaaccaacgtggtgagctgagcgtcgggcagctgctcaagtggattgacaccacg 

55 gcttgcctgtccgcggagaggcacgctggctgcccctgtgtcacagcttccatggatgac 
atctattttgagcacaccattagtgttggacaagtggtgaatatcaaggccaaggtgaac 
cgggccttcaactccagcatggaggtgggcatccaggtggcctcggaggacctgtgctct 
gagaagcagtggaatgtgtgcaaggccttggccaccttcgtggcccgccgagagatcacc 
aaggtgaagctgaagcagatcacgccgcggacagaagaggagaagatggagcacagtgtg 

60 

gcggctgagcgccggcgcatgcgccttgtctatgcagacaccatcaaggacctcctggcc 
aactgcgccattcagggcgatctggagagcagagactgtagccgcatggtgccggctgag 
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aagacccgtgtggagagtgtggagctggtcctgcctccccacgccaatcaccagggcaac 
acctttgggggccagatcatggcctggatggagaatgtggccaccattgcagccaggtga 
gggcagggtgtgctgcctctgcctcccctcctttctcctcctcctccccttggctacctc 
cctctggaggggaaaccccagcttggggttggcattcaaggcttcagaagcttggctgtt 
5 ctgaatcagagaaatgaatttttgtgaactgaccattccttgttctactaaaaaagctag 
catcttttacatgggaaacaccaggtctcttggcctggcactagatcctccccttgatct 
ggccctacctgcactccttctagtatctatgttcccttcacatcaagccttctagtatct 
atgttcgcttcacatcaaaccatttgctgttctctgttcccatcctccactttcccagcc 
cctgcctttgctcctgatgtagcctcctgccgtgcttcccctactcttctttgtctgcta 

10 atatcctgcccacttcctccataaagccatctctgactgttcccttcttctaaggggtga 
aaattgttttctctcctctaacatctgtttctgtccggggcttgttctaccctaaatatc 
agggtattttttatagttatggtaactgaccttcactaattgacactctcacacctccaa 
gactttgctcttgctgttccctctaccaggagtgcctttcccaacccatgccctttccag 
ccaggtggattcctccttattctttagagcctggcttaaatggcccctcctccagtttaa 

15 cctgtgggagacagtgcataagcaatgctgttttgggcaggcctggctatgagtgcagta 
agatcctggaggagcctgatggtcagggaaggctgcctgaaggaagagcacttcagctgg 
gacttgaatgccaagtagctttgggtaaggggagggcttctggatagtgggaacagcagc 
taccaaggtgtaaaagttggaaggaaaatgggaaaggggtttacccaaagccctgctttc 
ttctgtcccctcaaacttggcttctttccagccatgcatagacctcagtattctaaacta 

20 tgaaatgggactttagttctgtgcctctgggcagaactgccactgggttgggtggcagtg 
ggtgggtcagaatgtgtagttccaggctgcgtctggggatgggaccagggtagaaggccg 
gcccaagctggcctagcatggtggctcacacctgtaatcccagcgctttgggaggctgag 
gccacttaggccagaagttcaagaccagcctgggaacaaggtaaaacctcatctctacta 
aaaatacaaaaattagccaggtgtggtggtgcgtgcctgtagtctcagctacttgggagg 

25 ctgaggcaggagagtcacttgaacccgggaggcggaggttgcagtgagccgagattgcac 
cactgcattccagcctgggcaacagagtgaaaccctgtctcaaaaaaaaaaaaaaaaaaa 
aaaa 

>bc014075 (TOXMARKER Assignment: 57; SEQ ID NO: 171) 

30 ggagcgcagtcgctccgcgatggactcgccggtcccggcctctatgttcgcccccgagcc 
cagctccccgggggcggccagggccgcggcggccgccgcccgactccacggcggctttga 
ctcggactgcagcgaggacggcgaggcgctcaacggcgagccagagctggacctcaccag 
caagctggttctagtgagccctacatcagagcagtatgacagcctacttcggcagatgtg 
ggagaggatggacgagggatgcggagagaccatatatgtcattgggcagggatcagatgg 

35 gactgagtatgggctgagtgaagctgacatggaggcctcctacgccacagtgaagagcat 
ggcggaacagatagaggccgatgtcatccttctgcgggaacggcaagaagctgggggccg 
cgtgcgtgattacctggtccggaaacgagtaggagacaatgacttcctggaggtcagggt 
agcagtggtgggcaacgtggatgctggcaaaagcacgcttctgggggtcctgacacatgg 
ggagctggacaatggccgaggctttgcccgccagaaactcttccgccacaaacatgaaat 

40 tgaatctggtcgcaccagcagtgtgggcaacgacattctgggctttgacagtgaaggcaa 
tgtagtgaacaagcctgacagccacggcggcagcctggagtggaccaagatctgtgagaa 
gtccacgaaagtcattaccttcatcgacttggctggtcatgagaagtacctgaaaaccac 
tgtcttcggcatgacaggccatctgcctgacttctgcatgctcatggtgggcagcaatgc 
tggcatcgtggggatgaccaaagaacacctgggcttggcactggcactcaatgtacctgt 

45 ctttgtggtagtcaccaagattgacatgtgtcctgccaacatcctgcaagaaaccctgaa 
gctgttacagcgcctgctgaagtcaccaggctgccggaagatccccgtgctggtgcagag 
caaagatgatgtgattgtcacagcctccaacttcagctctgaaaggatgtgcccgatatt 
ccagatctccaacgttacaggcgagaacctagatctgctgaagatgttcctcaacctcct 
ctccccccgcaccagctacagggaggaggagcctgctgagtttcagattgatgacaccta 

50 ctccgtcccgggtgtggggacagtggtttcggggacaacactgagaggcctgatcaagct 
gaatgacacgctgctgctgggcccagaccccttgggtaacttcctgtccattgctgtcaa 
atccatccatcgcaagcgcatgcctgtcaaggaggtgcggggtggccagacagcatcctt 
tgcgctgaagaagatcaagcgctcgtccatccggaagggcatggtgatggtttccccacg 
tttgaatccccaagcctcctgggagtttgaggccgagattctcgtcctccaccaccccac 

55 cacaattagcccgcgctaccaggccatggtgcactgtgggagcatcaggcagacagccac 
cattctgagcatggacaaggactgtctgcgcactggggacaaggccactgtacacttccg 
cttcatcaagacccctgagtacctgcacatagaccagcggctggtgttccgggaaggccg 
caccaaggctgtcggcaccatcaccaagctcctccagaccaccaacaactccccaatgaa 
ctccaagccgcagcagattaaaatgcagtcgacgaaaaagggccccctgacgaaacgaga 

60 cgaggggggcccgtctggtgggccagcagtaggagcacccccacctggagatgaagcctc 
ctctgtaggggcagggcaaccagctgcgtccagcaatctccagcctcagcctaagcccag 
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cagtggaggccggcgacgagggggccagcgccacaaggtgaagtcccagggggcctgtgt 
gactcctgccagcggctgctgaaccttcccctggcccaccctcaccacccaaggggtcat 
catctctggccaccactccaccagatgggcagagcagctatgaccgccacccagccctcc 
cgctcaggccacagccggagcctccgcattgcccccacccccattttccaggggggttgt 
5 aatttataagctgacgaaggtagccagacttccggaggactgaccatctctcactgtcct 
ccccaccttcttcctcactcacacattttttgtacatctgggcccttagtttttattctg 
tttattatatgtctctgtctctctctattgtgtgtgtgtgtgtgtgtgtgtgtgtgtgtg 
tgtgtgtgtggtgcaggagtgccacccccagggccctgtcaacctctcttttctcctcca 
tggctgtctgcctgcgtatctgtctctgagaatcctcggggcggtcaggggatgtcagga 

10 ggggaaggagccgccctccctatcttgctgctcctcttggcactcaggggcaccttccat 
ggagccagaccgggtggaggggcttctgggatttggtgtctgctgctgccagagcaggaa 
cccccagtctaggacttgggcattttaacagggagaaagtagtggcttcccttttctctc 
tctcctcctttttccctttaagcccacagattcaggtcatgccaaaagctctctggttgt 
aacctggagacatgtggaggggaatggcgatgggattataggactctccccatctcgggc 

15 cctgaccctgacccttgccaccaacccaaagacagctggtgggtttccccttggagacaa 
tcctgcgtttgcctgggccggccctggctgccctcagctttcgctgatctgcccggcctg 
gagcctcccatcaccccgcttcttgttgggcctcaggcactggttaccagaagggggtct 
gggtctgctcaggatcatgttttgtagcacctcctgttggaggggtggagggatgttccc 
ctgagccaggctgagactagaaccccatcttccctgagccaggctgagactagaacccca 

20 tcttccccaccacgccacccctgtggctgctacaggagcacagtagtgaaggcctgagct 
ccaggtttgaaagacccaactggagcgtggggcgggcaggcaggggttagtgaaaggaca 
cttccagggttaggacagagcatttagccttctggaagaacccctgcctggggtgggact 
gtgcaggccagagaaggtggcatgggcctgaacccacctggactgacttctgcactgaag 
ccacagatggagggtaggctggtgggtgggggtggttcgttctctagccggggcagacac 

25 ccagctggctgggtccttcctcagccttgcctcctcctgtccccaaccctttcctttcct 
cctgcttgcggactgctggtcccctctccttccctccttccagctgtttctagttaccac 
ctacccctggccgtggactgatcagaccagcattcaaaataaaagtttgttccaaaaaaa 
aaaaaaaaaaaaaaaaaaaaaaaa 

30 >ak027871 (TOXMARKER Assignment: 122; SEQ ID NO: 134) 

gtttgacgatgagagtgatggggaagaagaggaggagctcatggatgaggatgtggaaga 
agaggatgactcagagatctcagggtacagcgtggagaatgccttcttcgatgagaagga 
agacacctgtgctgccgtgggggagatctctgtgaacaccagtgtggccttccttccata 
catggaaagtgtctttgaagaagtatttaaactgctggagtgccctcacctgaatgtgcg 

35 gaaggcagcccatgaggctctgggtcagttttgctgtgcactgcacaaggcctgtcaaag 
ctgcccctcggaacccaacactgctgctttgcaggctgccctggcccgagtcgtgccatc 
ctacatgcaggcagtgaacagggagcgggaacgccaggtggtgatggccgtgctggaggc 
cctgacaggggtgctccgcagctgtgggaccctcacactgaagccccctgggcgcctcgc 
tgagctctgtggcgtgctcaaggctgtgctgcagaggaagacagcctgtcaggatactga 

40 cgaggaggaggaagaggaagatgatgatcaggctgaatacgacgccatgttgctggagca 
cgctggagaggccatccctgccctggcagccgcggctgggggagactcctttgccccatt 
ctttgccggtttcctgccattattggtgtgcaagacaaaacagggctgcacagtggcaga 
gaagtcctttgcagtggggaccttggcagagactattcagggcctgggtgctgcctcagc 
ccagtttgtgtctcggctgctccctgtgctgttgagcaccgcccaagaggcagaccccga 

45 ggtgcgaagcaatgccatcttcgggatgggcgtgctggcagagcatgggggccaccctgc 
ccaggaacacttccccaagctgctggagctcctttttcccctcctggcgcgggagcgaca 
tgatcgtgtccgtgacaacatctgtggggcacttgcccgcctgttgatggccagtcccac 
caggaaaccagagccccaggtgctggctgccctactgcatgccctgccactgaaggagga 
cttggaggagtgggtcaccattgggcgcctcttcagcttcctgtaccagagcagccctga 

50 ccaggttatagatgtggctcccgagcttctgcgtatctgcagcctcattctggctgacaa 
caagatcccaccagacaccaaggccgcactgttgctgctcctgacgttcctggccaaaca 
gcacaccgacagctttcaagcagctctgggctcactgcctgttgacaaggctcaggagct 
ccaggctgtactgggcctctcctagactgcaggctgcagccagtccagagagaatagagc 
ctgcccaggccttaagaccacctctcagcccagttcagttctgccttaccaaagattctg 

55 agactcatacccatttggagccagccccacttgctgccttacagggctgtccctgaggct 
ggatctgttacaaatgagtcatgacatcatactgtaataaaagcagcttgttttctgctt 
gaacaatag 

>bc003394 (TOXMARKER Assignment: 63; SEQ ID NO: 149) 
60 cgaagactgagcggttgtggccgcgttgccgacctccagcagcagtcggcttctctacgc 
agaacccgggagtaggagactcagaatcgaatctcttctccctccccttcttgtgagatt 
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tttttgatcttcagctacattttcggctttgtgagaaaccttaccatcaaacacgatggc 
cagcaacgttaccaacaagacagatcctcgctccatgaactcccgtgtattcattgggaa 
tctcaacactcttgtggtcaagaaatctgatgtggaggcaatcttttcgaagtatggcaa 
aattgtgggctgctctgttcataagggctttgccttcgttcagtatgttaatgagagaaa 
tgcccgggctgctgtagcaggagaggatggcagaatgattgctggccaggttttagatat 
taacctggctgcagagccaaaagtgaaccgaggaaaagcaggtgtgaaacgatctgcagc 
ggagatgtacggctcctcttttgacttggactatgactttcaacgggactattatgatag 
gatgtacagttacccagcacgtgtacctcctcctcctcctattgctcgggctgtagtgcc 
ctcgaaacgtcagcgtgtatcaggaaacacttcacgaaggggcaaaagtggcttcaattc 
taagagtggacagcggggatcttccaagtctggaaagttgaaaggagatgaccttcaggc 
cattaagaaggagctgacccagataaaacaaaaagtggattctctcctggaaaacctgga 
aaaaattgaaaaggaacagagcaaacaagcagtagagatgaagaatgataagtcagaaga 
ggagcagagcagcagctccgtgaagaaagatgagactaatgtgaagatggagtctgaggg 
gggtgcagatgactctgctgaggagggggacctactggatgatgatgataatgaagatcg 
gggggatgaccagctggagttgatcaaggatgatgaaaaagaggctgaggaaggagagga 
tgacagagacagcgccaatggcgaggatgactcttaagcacatagtggggtttagaaatc 
ttatcccattatttctttacctaggcgcttgtctaagatcaaatttttcaccagatcctc 
tcccctagtatcttcagcacatgctcactgttctccccatccttgtccttcccatgttca 
ttaattcatattgccccgcgcctagtcccattttcacttcctttgacgctcctagtagtt 
ttgttaagtcttaccctgtaatttttgcttttaattttgatacctctttatgacttaaca 
ataaaaaggatgtatggtttttatcaactgtctccaaaataatctcttgttatgcaggga 
gtacagttcttttcattcatacataagttcagtagttgcttccctaactgcaaaggcaat 
ctcatttagttgagtagctcttgaaagcagctttgagttagaagtatgtgtgttacaccc 
tcacattagtgtgctgtgtggggcagttcaacacaaatgtaacaatgtatttttgtgaat 
gagagttggcatgtcaaatgcatcctctagaaaaataattagtgttatagtcttaagatt 
tgttttctaaagttgatactgtgggttatttttgtgaacagcctgatgtttgggaccttt 
tttcctcaaaataaacaagtccttattaaaccaggaatttggagaaaaaaaaaaaaaaaa 

>bc006191 (TOXMARKER Assignment: 79; SEQ ID NO: 156) 

ggcacgaggctttcaccccccccccccggccattaccgaagcggatgaaaacaaacacta 

acgatggcggcgccgggaagcgaccggctgctgggcttaaggcgggagtgaccgcttaac 

cagtgagggaagcactgaagagcgccagtcgacgtgggtgcgacaactcgcggagtctta 

ggagcaaaacgtctggggcctgcgagccaggacccttctgaagccttaggtgtctatcgg 

cgacgtgtacggtcactgcagctccggagcgcggaaccctcagccaggaggcgcggctgg 

tcggtcccaggtcccggcctccgtaatgagagcccggaaccactctttgtgccgcagctt 

cgcagcatcttggactcaagtgattctcctgcctcagcctcctgagtagctgggactaca 

gattcctataggcaatggaaactgatctcaattcccaggacagaaaggacctggacaagt 

ttattaaattttttgccctcaagactgtccaagtgattgtccaggctcggcttggtgaaa 

agatttgcactcgttcatcatcttctccaacgggttcagattggttcaacttagcaatca 

aagacatcccagaggttacacatgaagcaaagaaggcactggcaggacagctgcctgcag 

tcgggaggtccatgtgtgtggagatttcacttaagacttctgagggagattccatggagc 

tggaaatatggtgtcttgaaatgaatgaaaagtgtgataaagaaatcaaagtttcctaca 

cggtgtacaacagactgtcattgctgctgaagtcccttcttgctataactagggtgacac 

cagcctataggctctccaggaaacaagggcatgaatatgtcatattatacaggatatatt 

ttggagaagttcagctgagtggcttaggagaaggcttccagacagttcgtgttgggacag 

tgggcacccctgtgggcaccatcactctttcttgtgcttacagaattaacttggcattca 

tgtctaccaggcaatttgagaggaccccacctatcatggggattattattgatcactttg 

tggaccgtccctatcccagctcctctcccatgcacccctgcaattacagaactgctggtg 

a gg a cactggagtaatatacccgtctgtagaagactctcaagaagtgtgtaccacctctt 

tttccacctccccaccatcccagctgatggttcctgggaaggaaggtggggtaccccttg 

ctcccaaccagcctgtccatggtacccaggctgaccaggagagactggcaacctgcaccc 

cttctgacagaacccactgtgctgccacaccctccagtagtgaggatactgaaaccgtat 

caaacagcagtgagggacgggcctcccctcacgatgtcttggagaccatctttgtccgaa 

aagtgggggcttttgtcaacaaacccattaaccaggtgaccctgacgagtttggatatac 

cctttgccatgtttgctcccaagaatttggagctggaggataccgatccaatggtgaatc 

ctccagattccccagagactgaatctcctctccagggcagcctgccttgcagctggcccc 

ttccctgcctgctgtcaccatccactgtttgacattccagctggtggccaagagattggt 

gtggaggcagaaagaggaaggagacagtgccaggaggaagaaggaaggagtcccttagct 

ctcttcattgtcccctttacttcctgctatcttcttctcctcttcttctctctcttgcct 

ctatgcctgtatttctggcaatatgacaggcctgcctacccaagatcagaactccaaaac 

cactcccacccctgaaggtcgggagggtctgagcagccctggtggctgcctgtgctcagg 
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tcctcagctccatgggaaataaaaatggcaccctgaatctctaggattttgtcacttgga 
gtcacagcaaagttctcttcctcttgtccccccgttgctgctccttggttatagaacatg 
gtaaatatttattactttcagagaaaccagatattttatagaggaaatatgtttgaggtg 
agttgtttttcacttggagaaggcggagggctcttcctgggacggagacctcctcctccg 
5 gaggttattgagaatccgggctgctgctttgaggatcttcccaccatacagacagcgaga 
tccaagaagagggctggccgggggcaaagtcacctcccagtgtggctgcactggaactga 
ctaaaggctttaccttggatagttgcgtattcctggtgagagccttacatctcccacagt 
ttctgcagagtgactgactccattctggcagcccaggaagtcctgggtgctaaatgtgat 
ggccacatgtagtggttaggggatgttgtgtgtgtcccccaactgcctgggtacttgttc 

10 ctgatccctggggctgtcctgtggagcttttcctcctgcttgggcctagctaccatctcc 
ctctaatcccaggttctctacactgccctggggtttaccagctggattggcttctggttg 
agaaatcaaagctgggcgtatgattgacttaacccttcaggtattgttacttgaataagt 
caagtgcctagcctcacccacctatgatctgtcctttcccagcctcgctggtagtcctgg 
tcaaggagatctaggtctactccattcctcctggcccacctggggcattcactggcagca 

15 gctgtgcttcagtggagcaggtggttctcagctgcttgttagtatactgcatgtgacact 
gttcccacatacaaggctgacttctgaggattggagcaggctctggcggggaccagagct 
ctgcgtgctgctgctgccaccaagaagtgttagcagaagcagtagcagccaactggccct 
cctgactttggcccagagcacatgcgtggcttgctgaacccaggctcaggtttatcccca 
aggccccagctttgagaagggggaaggcccctggtaagttattgatgcccccatatttca 

20 gctactgctctctttccaaggccttgcatggaaaggcctagccattgtctgaggcagcaa 
tctttggcatctacaggtggcagcagcctttcaccagggctccatctgtgaagagtctca 
gccatgactttgagctgagcttgggagaagtaaagcaactgttaaggccagcccttgccc 
ctcagacctgccatgaaaggaatgagccctagactgactcctgcagcacccccgggacag 
gctgggaccagctgtttgtctccaggtgtcagagtccctcctcctcctccaacctctcca 

25 acctactttgtttggaaataccgagctacacttcaaaatgtattcaagggatttccaata 
aatttttttctgtaaaaaaaaaaaaaaaaaaa 

>ab014606 {TOXMARKER Assignment: 53/ SEQ ID NO: 143) 
cgctcccgccccagctcgcgctgcccgggcgggcgccggccgctggcgccgctactgctg 

30 ccgcccccggggcgcgagtccgccgcccgccgcccgggcacccggcgaggggcgggggca 
gctccgaaccggccccagatccttcccgcttccgcctcacgcttcccggaaagcttgtcc 
ctctccgccgagctgctccgggagccccgccgcgccgagggtatctcccagagccccagc 
tggtgtggccaggccccaggagtaggatggggctccccctacgagggccggtggcagcca 
gaactgatacagcccccctggtctggggccaggacgccagctgaggagggcaggagtgtc 

35 tggagctatggctggtgcctcggtgaaagtggcagtgagggttcggccctttaacgcccg 
tgagaccagccaggatgccaagtgtgtggtcagcatgcagggcaacaccacctccatcat 
caatcctaaacagagcaaggatgcccccaaaagcttcacctttgactactcctactggtc 
acacacttcgacggaggacccccagtttgcatctcagcagcaagtgtatcgggacattgg 
agaagagatgctgctccacgcctttgaaggctacaacgtgtgcatctttgcctatgggca 

40 gaccggggctgggaaatcctataccatgatggggcgacaggagccagggcagcagggcat 
cgtgccccagctctgtgaggacctcttctctcgcgttagtgagaaccagagtgctcagct 
atcctactctgtggaggtgagctatatggagatctactgtgagcgggtacgagacctctt 
gaaccccaagagtcggggttctctgcgggtccgggagcaccccatcctgggcccgtacgt 
gcaggacctgtccaaattggctgtgacctcctacgcagacattgctgacctcatggactg 

45 tggaaataaagcacggactgtggctgccaccaacatgaatgagaccagcagccgttccca 
tgccgtctttaccatcgtcttcacacagcgctgccatgaccagctcacggggctggactc 
ggagaaggtcagtaagatcagtttggtggaccttgctgggagtgagcgagccgactcctc 
aggggcccggggcatgcgcctgaaggaaggagccaacatcaataagtccctgactacact 
agggaaagtgatctcggcccttgcagatatgcaatcaaagaagcgaaagtcggattttat 

50 cccctacagggactctgtgctcacctggctgctcaaggaaaatttgggggggaactcacg 
cacagccatgattgcagccctgagccctgctgacatcaattacgaggagactctcagcac 
cctcaggtatgctgaccgcaccaagcaaatccgctgcaatgccatcatcaacgaggaccc 
taatgcccggctgattagagagctgcaggaggaagtagcccggctgcgggaactgctgat 
ggctcagggactgtcagcctctgctctggaaggcctgaagacggaagaagggagtgtcag 

55 aggcgccctgccagctgtgtcatctcccccagctccagtttcaccctcatcacccaccac 
acataatggggagctggagccgtcattctcccccaacacggagtcccagattgggcctga 
ggaagccatggagaggctgcaggagacagagaagattatagctgagctgaacgagacatg 
ggaggagaagctacgcaagacagaagccctgaggatggagagagaagcattgctggctga 
gatgggggtggccgtccgggaggatgggggaactgtgggcgtcttctctccaaagaagac 

60 tccccacctggtgaacctgaacgaagaccctctgatgtctgagtgtctgctctaccacat 
caaagatggcgtcaccagggtcggccaagtagatatggacatcaagctgaccggacagtt 
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cattcgggagcaacactgtctgttccggagcatcccccagccagatggagaagtggtggt 
cactctggagccttgtgaaggagctgagacatatgtgaatgggaagcttgtgacggagcc 
gctggtgctgaagtcagggaataggattgtgatgggcaagaaccacgttttccgcttcaa 
ccacccggagcaggcaaggctggaacgggaacgaggggtccccccacccccaggaccgcc 
ctctgagccagtcgactggaactttgcccagaaggaactgctggagcagcaaggcatcga 
cataaagctggaaatggagaagaggctgcaggatctggagaatcagtaccggaaagaaaa 
ggaagaagccgatcttctgctggagcagcagcgactgtatgcagactcggacagcgggga 
tgactctgacaagcgctcttgtgaagagagctggaggctcatctcctccttgcgggagca 
gctgccgcccaccacggtccagaccattgtcaaacgctgtggtctgcccagcagtggcaa 
gcgcagggcccctcgcagggtttatcagatcccccagcgacgcaggctgcagggcaaaga 
cccccgctgggccaccatggctgacctgaagatgcaggcggtgaaggagatctgctacga 
ggtggccctggctgacttccgccacgggcgggctgagattgaggccctggccgccctcaa 
gatgcgggagctgtgtcgcacctatggcaagccagacggccccggagacgcctggagggc 
tgtggcccgggatgtctgggacactgtaggcgaggaggaaggaggtggagctggcagtgg 
tggtggcagtgaggagggagcccgaggggcggaggtggaggacctccgggcccacatcga 
caagctgacggggattctgcaggaggtgaagctgcagaacagcagcaaggaccgggagct 
gcaggccctgcgggaccgcatgctccgcatggagagggtcatccccctggcccaggatca 
tgaggatgagaatgaagaaggtggtgaggtcccctgggccccgcctgaaggatcagaggc 
agcagaggaggcagcccccagtgaccgcatgccgtcagcccggcccccctcgccgccact 
gtcaagctgggagcgggtgtcacggctcatggaggaggaccctgccttccgtcgtggtcg 
tcttcgctggctcaagcaggagcagctacggctgcagggactgcagggctctgggggccg 
gggcggggggctgcgcaggcccccagcccgctttgtgccccctcacgactgcaagctacg 
cttccccttcaagagcaacccccagcaccgggagtcttggccagggatggggagcgggga 
ggctccaactccgctccaaccccctgaggaggtcactccccatccagccacccctgcccg 
ccggcctccgagtccccgaaggtcccaccatccccgcaggaactccctggatggaggggg 
ccgatcccggggagcgggttctgcacagcctgaaccccagcacttccagcccaaaaagca 
caactcttatccccagccaccccaaccctacccagcccagcggcccccagggccccgcta 
ccccccatacactactcccccacgaatgagacggcagcgttctgcccctgacctcaagga 
gagtggggcagctgtgtgagtcccacatcctgggcagagggcctggtggggccccttgct 
aggagaagggaagacgcccgagacgctgcttccccagaagtgctggggcagggaggccca 
ggagatgagagagaaggtccgagtaggtgatagaagacaagggggagaccgagccggagg 
ctgaggaaaggaagagggcacggagttgccaggagcaaaccaaagtgaagagagagatag 
gaagctgcctcggggccaccccttgcaaagggggtgtgtcccacaaacgctgctatgggt 
ggggtggggggctggggtgctgcgtagccagtgtttgactttcttttcaagtgggggaaa 
gtgggagaggactgagagtgaggcaagttctccccagcccctgtccgtctgtctgtctgt 
ctgtggtggtttctgtttcttgggaggcatggtaggatcataagtcattcccctcccctt 
ccaggcctcctgctatatttgggggacctgactggtttggctggagtcccatgaggatgt 
gggccctttaataaaggatagcaaacagggagcttgtggcctgtttgttttgggttttca 
tggaggtgtaggttatataaggcaatggcacaggtcttaagcatacttatcagtgaagta 
ttgtatgtgtgctctgtgcaggcaccacccagatctggatataagaatgtttccatcttg 
tcttcctgaacttcaccctcctgtctcttccttcaggttgcgcagcccgatcttttcccc 
gctttttttttttgggagacagggtcttgctttgttgcccaggctggaggtacagtcttg 
gctcactgcagcctccgcctcctgagtagctgggattacaggcatgtgccaccacgcccg 
gctcattactgttttttttgtagtgacgaggtttcgccgtgttggccaggctggtctcga 
actcctgatgacctcaagtgatccgcccgccttggcctcccagagtggtgggattgcaga 
gacagtgatcttgctatgctgcccaggctggtctcaaccttatgggctcaagtgatcctc 
ccacctcagcctcccaaagtgctaggattacctgcgtgagctacagcgccctgcctgttc 
tgggcttcttgcagagcctcttcagctgcagagaagcagctctcctttctccaagtccag 
agccaacaggacgaataatgaagctgttgggaagatttactgataatacatgtaaagggt 
ctagcacattttaggagctcaaggttggtgccttccctttttctttactctgaaccggat 
atgaggccttgagaaagaagagaggcgcttgcaaaacgaggtgaggtctcaggcacagtg 
gctcacgcctgtaatcccagcactttaggagaccgaggcgggcggatcatgaggtcagga 
gttcgagaccagcctggccaacatggtgaaagcccgtctctactaaaaatacaaaaatta 
gacgggcatggtggtgggcacctgtaatcccggctacttgggaggttgagggaggagaat 
cgcttgaacccaggaggtggaggttgcagtgagccgagactgcaccattgcactccagcc 
tgggcaatagagcgagactccgtctcaagcaagcaagcaagcaaacaaacaaaataaaaa 
acgaggtcaagtttcaaaagatgtcacccccaacctggcaaaacttctcctcaagccctg 
tcgttccactcttgtccgccaggaggagaaaaggttccctcgaaggacgtctttgcttgc 
gcgttcacggagccttgagaacgagtggccgaggggacccctgcggccctgcgcgcctaa 
99gaggacctgactcctttcagaagtagcatttcttccccttcgtgggtgctcttgagtt 
ccaaagaaaaggaagagaagccttcattgagcagcttcttctgccttagggactgtgcta 
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gggggtagatcgaccttaggggaaacaatccccgcttattagaggaggttttggatcagg 
gtttgctttatttgaaatttaacaaatacagaaaagcagaaggaagaaaattgaagtaat 
ccatgtttccactgggcgcggcggctcacgcctataatcccagcactttgggaggccaag 
gcgggcggatcacgaggtcaggagttcgagaccagcctgaccaacatggtgaaacccccc 
5 gtctctactaaaattacaaaattagccggtcgtggtggcacacgcctgtaatcccagcta 
ctcaggaggctgaggcaggagaatcgcttgaacccgggaggcagaggttgcagtgagccg 
agattgcaccactgcactccagcctgagcaacagagtgagactccgttgtctttaaaggc 
caatcccatagcaaatgacagagactcacttgagtaagaaaggtttttgacaagaaaacc 
cacagaagaagaggtaagctgtggatataagaaaggcactacaatctgtatttaaatata 
10 attgcacactagataatatatgtatgaaaaattattt 

>u78525 (TOXMARKER Assignment: 61; SEQ ID NO: 148) 

aattccggccgcgtcgacgggagagtcggtagcgcggcggccgcggagccctgcgagtag 

gcaagcgttgggcccatgcaggacgcggagaacgtggcggtgcccgaggcggccgaggag 

15 cgcgccgagcccggccagcagcagccggccgccgagccgccgccagccgaggggctgctg 
cggcccgcggggcccggcgctccggaggccgcggggaccgaggcctccagtgaggaggtg 
gggatcgcggaggccgggccggagcccgaggtgaggaccgagccggcggccgaggcagag 
gcggcctccggcccgtccgagtcgccctcgccgccggccgccgaggagctgcccgggtcg 
catgctgagccccctgtcccggcacagggcgaggccccaggagagcaggctcgggacgag 

20 cgctccgacagccgggcccaggcggtgtccgaggacgcgggaggaaacgagggcagagcg 
gccgaggccgaaccccgggcgctggagaacggcgacgcggacgagccctccttcagcgac 
cccgaggacttcgtggacgacgtgagcgaggaagaattactgggagatgtactcaaagat 
cggccccaggaagcagatggaatcgattcggtgattgtagtggacaatgtccctcaggtg 
ggacccgaccgacttgagaaactcaaaaatgtcatccacaagatcttttccaagtttggg 

25 aaaatcacaaatgatttttatcctgaagaggatgggaagacaaaagggtatattttcctg 
gagtacgcgtcccctgcccacgctgtggatgctgtgaagaacgccgacggctacaagctt 
gacaagcagcacacattccgggtcaacctctttacggattttgacaagtatatgacgatc 
agtgacgagtgggatattccagagaaacagcctttcaaagacctggggaacttacgttac 
tggcttgaagaggcagaatgcagagatcagtacagtgtgatttttgagagtggagaccgc 

30 acttccatattctggaatgacgtaaaagaccctgtctcaattgaagaaagagcgagatgg 
acagagacgtatgtgcgttggtctcctaagggcacctacctggctacctttcatcaaaga 
ggcattgctctatgggggggagagaaattcaagcaaattcagagattcagccaccaaggg 
gttcagcttattgacttctcaccttgtgaaaggtacctggtgacctttagccccctgatg 
gacacgcaggatgaccctcaggccataatcatctgggacatccttacggggcacaagaag 

35 aggggttttcactgtgagagctcagcccattggcctatttttaagtggagccatgatggc 
aaattctttgccagaatgaccctggatacgcttagcatctatgaaactccttctatgggt 
cttttggacaagaagagtttgaagatctctgggataaaagacttttcttggtctcctggt 
ggtaacataatcgccttctgggtgcctgaagacaaagatattccagccagggtaaccctg 
atgcagctccctaccaggcaagagatccgagtgaggaacctgttcaatgtggtggactgc 

40 aagctccattggcagaagaacggagactacttgtgtgtgaaagtagataggactccgaaa 
ggcacccagggtgttgtcacaaattttgaaattttccgaatgagggagaaacaggtacct 
gtggatgtggtcgagatgaaagaaaccatcatagcctttgcctgggaaccaaatggaagt 
aagtttgctgtgctgcacggagaggctccgcggatatctgtgtctttctaccacgtcaaa 
aacaacgggaagattgaactcatcaagatgttcgacaagcagcaggcgaacaccatcttc 

45 tggagcccccaaggacagttcgtggtgttggcgggcctgaggagtatgaacggtgcctta 
gcgtttgtggacacttcggactgcacggtcatgaacatcgcagagcactacatggcttcc 
gacgtcgaatgggatcctactgggcgctacgtcgtcacctctgtgtcctggtggagccat 
aaggtggacaacgcgtactggctgtggactttccagggacgcctcctgcagaagaacaac 
aaggaccgcttctgccagctgctgtggcggccccggcctcccacactcctgagccaggaa 

50 cagatcaagcaaattaaaaaggatctgaagaaatactctaagatctttgaacagaaggat 
cgtttgagtcagtccaaagcctcaaaggaattggtggagagaaggcgcaccatgatggaa 
gatttccggaagtaccggaaaatggcccaggagctctatatggagcagaaaaacgagcgc 
ctggagttgcgaggaggggtggacactgacgagctggacagcaacgtggacgactgggaa 
gaggagaccattgagttcttcgtcactgaagaaatcattcccctcgggaatcaggagtga 

55 cctggagcactgtggggacggactccgcctgctgttcccgcgctgagctacaggactccc 
gagtgtgagccgcggttcctctgttgcagcgcagccgtgtgtgctgtggagccgaggccg 
tcctgcaggaagccgcgtgactcccgcctcctccctgtgctctctggctctggactgtga 
ctgcgcctggattctgccattgcgacacatttttgtgcctttcagcccctggtgtctgca 
gtgggggatttaaggcacccgcttccacttctttcttgtttggagttttctgttggaacc 

60 gccggcgttggctccgaagacttagcgacgccactggcggcaccttctcctgcgcccagt 
gatgtttccacggtgcctgtacacagccgagcagcatttccgttgaaggacttgcatccc 
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cattgcgggcagtgctggacgtgtcccggagacccaccgggaggcgccgcatgccttgta 
cccccaccgtgcaggttgtggccggttttctccgcaggttgaacatggaaataaaagcaa 
acttgtatggaattcaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 

>bc014348 (TOXMARKER Assignment: 88; SEQ ID NO: 157) 

ggggagtgccgggcggtcggcgggtcagggcagcccggggcctgacgccatgtcccggaa 

cctgcgcaccgcgctcattttcggcggcttcatctccctgatcggcgccgccttctatcc 

catctacttccggcccctaatgagattggaggagtacaagaaggaacaagctataaatcg 

ggctggaattgttcaagaggatgtgcagccaccagggttaaaagtgtggtctgatccatt 

tggcaggaaatgagagggctgtcatcagctctgattaagaaaggagatttcttcatgctt 

tcgattctgcatggggtacagccagtcacctcaccagagaatgacggctggagaagaaaa 

ctctgtaataccataaataagagtgcttgtaataaaagactgtgcacaaggattaatatt 

tcccttcttaagtatcaaaagaactctggaacaaattataccattaggaaggttttcatg 

attcagttgattttccaaaaatgaagctatctcacccagctgggtttggaggagcaatct 

gcttattattctgtcgttaccacttactcaagcgagctgtgatatgaatacaagcaacca 

gtgggctcgggaaggtccgggtctcttctgccatcttccagataagagatttcagtaaaa 

aactgccatgctgagctgccttatagagctcttcgaaaatgttcgagttgataaagctct 

ttgaggacaaggtacttcgtgcacctcatgctgaagattgcaccgtgttggaaaataaat 

atgaagcaagtcaaactagatgcatacacttgtgtagaaatcaataatcaattaatagaa 

gtgaaaaaatagacattaaaatgatttatttcaaaaaaaaaaaaaaaaaaaaaaaaaaaa 
aa 

>m94151 (TOXMARKER Assignment: 89; SEQ ID NO: 165) 

gggctgcaggaattcccccacagagggagcatgacttcggcaacttcacctatcattctg 

aaatgggaccccaaaagtttggaaatccggacgctaacagtggaaaggctgttggagcca 

cttgttacacaggtgactacacttgtcaacacaagcaacaaaggcccatctggtaaaaag 

aaagggaggtcaaagaaagcccatgtactagctgcctctgtagagcaagccactcagaat 

ttcctggaaaagggtgaacagatcgctaaggagagtcaagatctcaaagaagagttggtg 

gctgctgtagaggatgtgcgcaaacaaggtgagacgatgcggatcgcctcctccgagttt 

gcagatgacccttgctcgtcggtaaagcgcggcaccatggtacgggcggcaagggctttg 

ctctccgcggtgacacgcttactcatcctggcggacatggcagatgtcatgagactttta 

tcccatctgaaaattgtggaagaggccctggaagctgtcaaaaatgctacaaatgagcaa 

gaccttgcaaaccgttttaaagagtttgggaaaaagatggtgaaacttaactatgtagca 

gcaagaagacaacaggagctgaaggatcctcactgtcgggatgagatggcagccgcccga 

ggggctctgaagaagaatgccacaatgctgtacacggcctctcaagcatttctccgccac 

ccagatgtcgccgctacgagagccaaccgagattatgtgttcaaacaagtccaggaggcc 

atcgccggcatctccaatgctgctcaagctacctcgcccactgacgaagccaagggccac 

acgggcatcggcgagctggctgcggctcttaatgagtttgacaataagattatcctggac 

cccatgacgttcagcgaggccaggttccggccgtccctggaggagaggctggagagcatc 

atcagcggcgcagcgctgatggccgactcctcctgcacgcgagacgaccggcgcgagagg 

atcgtggcggagtgcaacgccgtgcggcaggcgctccaggacctgctcagcgagtacatg 

aataatactggaaggaaagaaaaaggagatcctctcaacattgcgattgataagatgact 

aagaaaacaagagatctaaggagacagcttcggaaagcagtgatggatcacatatctgac 

tctttcctggaaaccaatgttcctttgctagttctcattgaggctgcaaagagcggaaat 

gaaaaggaagtgaaagaatatgcccaagttttccgtgagcatgccaacaaactggtagag 

gttgccaatttggcctgttccatctccaacaatgaagaaggggtgaaattagttcggatg 

gcagccacccagattgacagcctgtgtccccaggtcatcaatgccgctctgacactggct 

gcccggccacagagcaaagttgctcaggataacatggacgtcttcaaagaccagtgggag 

aagcaggtccgagtgttgacagaggccgtggatgacatcacctcagtggatgacttcctc 

tctgtctcagaaaatcacatcttggaggatgtgaacaagtgtgtgatagccctccaagag 

ggcgatgtggacactctggaccggactgcaggggccatcaggggccgggcagctcgagtc 

atacacatcatcaatgctgagatggagaactatgaagctggggtttatactgagaaggtg 

ttggaagctacaaaattgctttctgaaacagtgatgccacgcttcgctgaacaagtagag 

gttgccattgaagccctgagtgccaacgttcctcaaccgtttgaggagaatgagttcatc 

gatgcctctcgcctggtgtatgatggcgttcgggacatcagaaaggctgtgctgatgatc 

aggaccccagaagaactagaggatgattctgactttgagcaggaagattatgatgtgcgt 

agagggacaagtgttcagactgaggatgaccagctcattgcagggcagagcgcacgggcc 

atcatggcgcaactaccgcaggaggagaaggcaaaaatagctgagcaggtggagatattc 

catcaagagaaaagcaagctggatgcagaagtggccaaatgggacgacagcggcaatgat 

atcattgtactggccaagcagatgtgtatgatcatgatggaaatgacagacttcacaaga 
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ggcaaaggcccattgaaaaatacatctgatgtcattaatgctgccaagaaaattgccgaa 
gcaggttctcgaatggacaaattagctcgtgctgtggctgatcagtgtcctgattcagca 
tgtaagcaggatttattagcctaccttcaacgaattgccttgtattgccatcagcttaat 
atctgcagcaaggtgaaggcagaagtgcagaatctgggaggagagctcattgtgtcaggg 
acaggagttcagagcactttcactaccttttatgaggtagattgtgatgtcatagatggg 
ggcagggctagtcaactttctacccacctcccaacctgtgctgagggagctccgatcggg 
agtggaagcagtgattcctccatgctggacagtgccacatcgcttatccaggcagctaaa 
aacctgatgaatgctgttgtcctcacggtgaaagcatcctatgtggcctcaaccaaatac 
cagaaggtctatgggacagcagctgtcaactcacctgttgtgtcttggaagatgaaggct 
ccagagaagaagccccttgtgaagagagaaaagcctgaagaattccagacacgagttcga 
cgaggttctcagaagaaacacatttcgcctgtacaggctttaagtgaattcaaagcaatg 
gattccttctaggacgataggttttaacaagaaagctttttctttcttttctttctttct 
ttttctttttaattccatttttgtatgcatacctgccagctcgtatgcctctggcatggg 
gaaattaagggaacagtgtctgtttgcatgtaagatgagatgagatcaatactactgatc 
catctgtagcctgggaaggagacaggacattcctgtactaaggtggcacagagctgtcct 
ttgcaacattctcataaaattgggcacagagttcgcattggcgcaatatttatgggagtg 
99 a 999 at 9999 aaaa t aaa cttaactctacaaaagcaaactctaatgcatgcaagaatc 
attaggttggcaggtatatgcataagtgaaaaatctggaagtgtaatggtagaacataaa 
acttgtattgcttctgtttcagtgcaaaaatgtactagccaatacgcttaagtgtgtggc 
ccatgaattgaacaatttaaccttgaagtctatatccgtgatattatgtcgatttttaac 
tgaggggaaattaactagtccagcctaaaatgcttcttttaatctgcattctgtttcctc 
ttctagttgtgccattactagtgatcatgtttttttcccccctttaatgaaaacaataaa 
catctatttgagacaattaaaatccttctgggggcactggaagcacaatacggtgaccaa 
tcttgctttcatttttttttctttttaatttgaaccatgattttgctagaaatagaaggc 
ccagtggtggaatattagagggaaggaaactgacaacgtgtgaaagtta 

>nm_000521 (TOXMARKER Assignment: 38; SEQ ID NO: 130) 

ctgatccgggccgggcgggaagtcgggtcccgaggctccggctcggcagaccgggcggaa 

agcagccgagcggccatggagctgtgcgggctggggctgccccggccgcccatgctgctg 

gcgctgctgttggcgacactgctggcggcgatgttggcgctgctgactcaggtggcgctg 

gtggtgcaggtggcggaggcggctcgggccccgagcgtctcggccaagccggggccggcg 

ctgtggcccctgccgctctcggtgaagatgaccccgaacctgctgcatctcgccccggag 

aacttctacatcagccacagccccaattccacggcgggcccctcctgcaccctgctggag 

gaagcgtttcgacgatatcatggctatatttttggtttctacaagtggcatcatgaacct 

gctgaattccaggctaaaacccaggttcagcaacttcttgtctcaatcacccttcagtca 

gagtgtgatgctttccccaacatatcttcagatgagtcttatactttacttgtgaaagaa 

ccagtggctgtccttaaggccaacagagtttggggagcattacgaggtttagagaccttt 

agccagttagtttatcaagattcttatggaactttcaccatcaatgaatccaccattatt 

gattctccaaggttttctcacagaggaattttgattgatacatccagacattatctgcca 

gttaagattattcttaaaactctggatgccatggcttttaataagtttaatgttcttcac 

tggcacatagttgatgaccagtctttcccatatcagagcatcacttttcctgagttaagc 

aataaaggaagctattctttgtctcatgtttatacaccaaatgatgtccgtatggtgatt 

gaatatgccagattacgaggaattcgagtcctgccagaatttgatacccctgggcataca 

ctatcttggggaaaaggtcagaaagacctcctgactccatgttacagtagacaaaacaag 

ttggactcttttggacctataaaccctactctgaatacaacatacagcttccttactaca 

tttttcaaagaaattagtgaggtgtttccagatcaattcattcatttgggaggagatgaa 

gtggaatttaaatgttgggaatcaaatccaaaaattcaagatttcatgaggcaaaaaggc 

tttggcacagattttaagaaactagaatctttctacattcaaaaggttttggatattatt 

gcaaccataaacaagggatccattgtctggcaggaggtttttgatgataaagcaaagctt 

gcgccgggcacaatagttgaagtatggaaagacagcgcatatcctgaggaactcagtaga 

gtcacagcatctggcttccctgtaatcctttctgctccttggtacttagatttgattagc 

tatggacaagattggaggaaatactataaagtggaacctcttgattttggcggtactcag 

aaacagaaacaacttttcattggtggagaagcttgtctatggggagaatatgtggatgca 

actaacctcactccaagattatggcctcgggcaagtgctgttggtgagagactctggagt 

tccaaagatgtcagagatatggatgacgcctatgacagactgacaaggcaccgctgcagg 

atggtcgaacgtggaatagctgcacaacctctttatgctggatattgtaaccatgagaac 

atgtaaaaaatggaggggaaaaaggccacagcaatctgtactacaatcaactttattttg 

aaatcatgtaaaataagatattagacttttttgaataaaatatttttattgattgaa 

>ab007887 (TOXMARKER Assignment: 91; SEQ ID NO: 158) 
gtcagatcagggatcattttttttccttcctctactccctcccccctacccgcccctccc 
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tccctgtttcccttccctccctccctcccctctctgctgggtctgtgcgctggggcgccc 

gatcccctccgcagctgggacgctccgaactcgaggcaggagtcggctctccggagcctc 

gtccctcccttccccttccctgcccccttcccccacccccgactcgggcttggcgcggcg 

gccagaggaaccccgagtcccggcccaggcccctgagctggagggatggaaaactcctct 

gcagcatcagcctcctcggaggcagggagcagccgctcccaggagatcgaggagctggag 

cgcttcatcgacagctacgtgctggagtaccaggtgcaggggctgctggctgacaagacg 

gagggtgatggcgagagcgagaggacccagtcccacatctcccagtggacagcggactgc 

agcgaaccgctggacagcagctgttccttctcccgagggcgagcccccccacagcagaat 

ggcagcaaagacaactctctggacatgctgggcacggacatctgggcggccaacaccttc 

gattccttcagtggtgccacctgggacctgcagccggaaaagctggacttcacccagttc 

caccgcaaagtccgacacacgcccaagcagcccctgccacacatcgaccgcgaagggtgt 

ggcaaagggaagctggaagatggggatggcatcaacctgaatgacatcgagaaggtcctt 

ccagcctggcagggctaccacccgatgccccatgaagtggagatcgcacacaccaagaag 

ctgttccgcaggaggagaaatgatcgaaggcggcagcagagacctccggggggcaacaag 

ccccaacagcatggtgaccaccagccaggcagtgccaaacacaacagggaccaccagaaa 

tcctaccaggggggctcagcaccccacccctcagggaggcccactcaccatggctacagc 

cagaaccggcgctggcaccatggcaacatgaagcacccaccaggcgacaagggggaggca 

ggcgcacaccgcaatgccaaagagaccatgaccatcgagaacccaaaactggaggacact 

gcaggggacaccgggcacagcagcctcgaggccccccgcagccctgacaccctggccccg 

gtggcttctgagcggctgcccccacagcagtcaggggggccagaggttgagacaaaacgt 

aaagacagtattcttcccgagcgcatcggggagcggcccaaaattaccctgctccagtct 

tccaaagacagactgcggcgaaggctaaaggaaaaggatgaagtggccgtggagacgacc 

actccccagcagaacaagatggacaagctgatcgagatcctgaacagcatgcggaacaac 

agcagcgacgtggacaccaagctcaccaccttcatggaggaggcccagaactccaccaac 

tccgaggagatgctgggcgagatcgtgcgcacaatctaccagaaggctgtgtccgaccgc 

agcttcgccttcaccgctgccaagctctgcgacaagatggcgctctttatggtggagggg 

accaagttccggagcctgctcctcaacatgctgcagaaggacttcacggtgcgcgaggag 

ctgcagcagcaggacgtggagcgctggctgggcttcatcaccttcctgtgtgaggtcttc 

ggcaccatgcgcagcagcacaggcgagcccttccgtgtgctcgtgtgccccatctacacc 

tgcctcagggagctcttgcaatctcaggatgtgaaggaagatgctgtcctttgctgctct 

atggagctgcagagtacaggccggctgctggaggaacagctgcctgagatgatgacagag 

ctcctggccagcgcacgggacaagatgctgtgcccctcggagtccatgctgacccggtcg 

ctgctcctagaggtcatcgagctccacgctaacagctggaaccctctgacgccccccatc 

acgcagtactacaacagaaccatccagaaactgacagcctgacagccagggggcctggca 

ggcggcccacgggcagctggggccctggtgcacagggccagatggacaggcgggaggaca 

99ggtggccctggcgggagaaagaaatggggaggagggcaggcagagtcggtggccagtc 

tggagccagacggggaagggagcaaatccctgagaggagtgcccccgcacaagcccccca 

gcccgagcatgcaagctcacaccaataagggaagcatgtttctttttcctggtggccctg 

gccctccccttcctcactcccgcctctcccctccccatcagacccatcccccacggagct 

ttgtgtgagggatctcatcgctgtgactcctcggagaccttggcagcctcgcacgccggg 

gcaccgcttgggtcagaaaggacctcggaaggctgaaaaagtgggtcggagacgggctcg 

cattgttcccgcatgctgtcagccgcagtcgccaactggcagcaggcgacgtgtagcaga 

tgtccgggaggacaaaggcaggcacggtccccaccagccgcccgtaattgacggcctttg 

tcagccatggcagagctgacgctccacctcccacctccaagtcctcctcactgcagcccc 

cacagcctcaggcctagggggtcaggcgcagcgggggagatggagtttgcagttccactt 

gcactcttttgtttattgtgttttatttttcaaaagtcggttgctttgaagtctctttgg 

ccaatgaaaatgcccgtgaggtgatcacacagtcagcactgttgaggacccccggattag 

tgggagatcaaacccagctcccctctagaagaaggattcgagccacagacagcttgccag 

tagccaattagggtaattggaaacttctgccccggcggggggtccccgctggaatcctgt 

gttcctcgccactggcttccagcgcctctgttttctcaaagggctgatactgtcaccact 

gggaccaagttaaacctggtcctggccccaggggccttgtggcaaacagggcacagaacg 

agactggcaaattaaaaccaaaattctagatggtgtcttgcgctccacacgcaggtctta 

ctggggaaaaggatgggagtgggggctccccaggactcgattttagctaatgcgctgtgt 

cactgccccagctcggacgtagaagcccagccctccgtgagctcttgggaaaggggtgaa 

ttcactgggtcatggaagggacagtcaggtgaccagcggggtcgccagatgaagcttccc 

agccgggaaacaagacggggtttcttggcaggccctggtcctggggagcaggccctgttg 

ttggctggagaggaaggtgtggggtggaacaggtgtccacatagctccatctctgggggc 

tggagcacacactttgatgagcccccccggaaatgatgtcagagcctagccgcttcctta 

tttgctcttttattgaggccgggcaggccctgggtcactttggaggcccctcttggtcca 

cactggactggccgggaggtgatgggcggggaaggttctcgtgattgattgattctgagt 

ctgagagtggcgagtggggagaggcttccccagttctctccagctttccctgcagctgca 
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acctgccctctggtcccaggtgtggagcctttgcctgtctctaaaaagagcctgttggcg 
acaaggtgtagggggcacaagtttacctgaaacaggtcagtggtctctcccaagaagcgc 
acgccacctctggtccctggccctgaaccctgccttcttcctccctccacggtttcttcc 
cagactttctcaagctcctcctcactgcccttcctccccagcccagcctgggaacacaga 
tgccccgcgggtaggaggcctcgagggaggagccgggctgatgcggggctgctcagggca 
ggccccagggcgagcttgccatcgtggccaggcagcctccacctgtgcttcagtggcccc 
tgcccccctgaagcatgtggggtttgtccgctaggaggaggcaaggcccccgaagagagg 
agagacctgggagtgggagctcaggtcagggaggaggcaggggagtggggtctcccagac 
ccaacggtgagctcagagcaagcttcacgcaggacgctccgaaacactgtgtggaggggg 
ctgtgttgtgggcaccttggggcctgattctccttcctccgaacgggctccttgatggcc 
tggccacaggggcagctccccattggctgttaggaccagagtgtgaagaagaagtgaaat 
ataaatatgtatacatatataaatatatttttaattacatgtcgtgtcacggtggctcca 
gacatactgtttgcctagtttattccactgcttgaaagcgcttcctagccaatctgaaca 
acaacactttaagctgtttttctaaatgcaggttgctgctcctttttcagatatggaagg 
aaaacgttaagactattttttttttaaagaaacaacagtcaagcctaaaatttgagaccc 
cgaggcagcttcccgagggagactgctcagacaggaactgcaggacagaagtggatgccc 
cacagaccctggccccctccccaagtccatcccctctctgtggcatgaggaaggccgcgt 
ccgagttgacctctgaatgtatgtgatgagaggcagagctggatattgcatttctaaggc 
ttgcattgctttcccctcgcccgcggttcttggcgcatggaagaggcggtccagccatct 
gatgttgatcctgtctcagtctccccactgcctgtcaggatgagttagtcattgtttttc 
tccgaggcggcctgcttgccacagccctgctccccaaggcctggtggctttgccgaagct 
ctgggaccgcagccccagcgaggcccccaacctcacccagacgaggccaggagccccgcc 
accctccacgggatgtgcaccctcagaccccattctctctgttcgtccttccttgaccag 
tctgtaaaccttcactgtttggggatcgtcctgtccatccatgtaaatgtaaatgttggc 
cgagtcggtatttattctgattgatttttattttattctattattttctccgagggatga 
gggtggggggtgtgggaagggtaccacagatcaggccggggcagctgtaggggcgggggc 
ccagacagccaggccgccaccagagcagccccatggggtgccccagacgcgggcctccaa 
gaagccaagtcccagtctgttttctggcatcagacaccggcccgtgttccttgtcagaca 
gacagactctcaggcctgcctggggagtcgtgtccctcagctgcagggcactgtgttggg 
aaaccattggctgggcctttgaggacacagatcagaagaaagaaagacaactttcctctg 
cgcggaacactcacacggaagggctggccgcctccctgagccggctgggagtggacgaca 
ggacctacctccccagagcaagggcctggggcttcccgccaaagctgccgcggaaccccg 
ctagtgcgaccaccctccctccgtcggtatgtcctgctttccagctgaacccaaactaca 
a gtgggtttaaaaaaaataaacaccaccaccaaaaac 

>ab021179 ( TOXMARKER Assignment: 65; SEQ ID NO: 150) 
gcaggttgggagggaaagtcgggggaggacgcggaagaggagctgtgggaagggggagga 

gggagggaggaaaagaggaggagacggaggagaactgagcagagcagagcatcgagccaa 
a 9ggg a 9 a tgagtttgtctgtcctctgctgaggctacggccgggcctagggaactgggag 
cttgggtggaagcgacacccgtggaagtgggaggaggtggcgccgggactttaacccctt 
gtgggctctgcggcaggggatttaaccctttgtggatctggcccctcggaggcagcgtca 
tcggtagttttaaccccttcggggctgggtttcacgcactggacttaccctcatcacctt 
gctcaccaactcctttattggggtgctccgcttggaggtttgaggcccacctccgcccat 
tacgtactgttcctgccgctgcacccccttggacccgctagctggccgcactgtgggcgc 
ttaaccctttactgacttgagctccccagattgcagttggagtttgctgatagaaggact 
agctaaaggcgtcactgcaggaattacaaactgaagaggactctgttggactgttttttt 
tttctttttcttttttttaagaaaaacccatttttttccttaaggacttactagccaaaa 
tttcttaaacttcgaggactctactagccatggccgagccattcttgtcagaatatcaac 
accagcctcaaactagcaactgtacaggtgctgctgctgtccaggaagagctgaaccctg 
agcgccccccaggcgcggaggagcgggtgcccgaggaggacagtaggtggcaatcgagag 
cgttcccccagttgggtggccgtccggggccggagggggaagggagcctggaatcccaac 
cacctcccttgcagacccaggcctgtccagaatctagctgcctgagagagggcgagaagg 
gccagaatggggacgactcgtccgctggcggcgacttcccgccgccggcagaagtggaac 
cgacgcccgaggccgagctgctcgcccagccttgtcatgactccgaggccagtaagttgg 
gggctcctgccgcagggggcgaagaggagtggggacagcagcagagacagctggggaaga 
aaaaacataggagacgcccgtccaagaagaagcggcattggaaaccgtactacaagctga 
cctgggaagagaagaaaaagttcgacgagaaacagagccttcgagcttcaaggatccgag 
ccgagatgttcgccaagggccagccggtcgcgccctataacaccacgcagttcctcatgg 
atgatcacgaccaggaggagccggatctcaaaaccggcctgtactccaagcgggccgccg 
ccaaatccgacgacaccagcgatgacgacttcatggaagaagggggtgaggaggatgggg 
gcagcgatgggatgggaggggacggcagcgagtttctgcagcgggacttctcggagacgt 
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acgagcggtaccacacggagagcctgcagaacatgagcaagcaggagctcatcaaggagt 

acctggaactggagaagtgcctctcgcgcatggaggacgagaacaaccggctgcggctgg 

agagcaagcggctgggtggcgacgacgcgcgtgtgcgggagctggagctggagctggacc 

ggctgcgcgccgagaacctccagctgctgaccgagaacgaactgcaccggcagcaggagc 

gagcgccgctttccaagtttggagactagactgaaacttttttgggggagggggcaaagg 

ggactttttacagtgatggaatgtaacattatatacatgtgtatataagacagtggacct 

ttttatgacacataatcagaagagaaatccccctggctttggttggtttcgtaaatttag 

ctatatgtagcttgcgtgctttctcctgttcttttaattatgtgaaactgaagagttgct 

tttcttgttttcctttttagaagtttttttccttaatgtgaaagtaatttgaccaagtta 

taatgcatttttgtttttaacaaatcccctccttaaacggagctataaggtggccaaatc 

tgagaacaattaaattcattttagttataataaatttaatatttgtaaatgtaacatagt 

ttcagtgtgatttctagagctaattcaaaatagtattgatatattttatgtgactgcatt 

tttggggaggggtaccgaaatcgttaaatttgtcagtttgcaaaaatatcaatctttaat 

gggagaattttcaatttgccaattttttccttgaatgggtttaagtatgctacaatatac 

agttcaggcaaaatttaagatgtaattatcttcaatacttaagtgtgcttgctttctagt 

gccttggttttctttcttgatgctggaaaaataaacaaaccggtattgagtgtttaggcg 

agtggaaagtggctacaatccaaaattttaaatttaactctgcctcggccattcaaaagt 

ctaataacaaaaaatgtaaacctaatttggcagtttgttaggttagacaactgacagcct 

catttcattcctacaagttggttttcagtaatctcttccttccccccagtaaggctggaa 

gaggctcttggcaaacttcttagcgcaagcaatggttagattaatttgtgaggcagctct 

ttaagacgttcagaggtaagaaatactggatttataaagcaaatggctgtttgggggatt 

ccaaggatttacctaattgtccaattctacgtgctctctataccaaaacaaaaaaaaagc 

tatccacctttccatgtgggtcaaactaaaattagaaatgtcccctcactgcagatcaaa 

tgtaaagcttccagttaaggagctaaatgaggtcctcagctgaatgaggaaccctgtaca 

tccccttgcacagccctattctaaatcgcttaaactatgctgatagctgcttaggttctt 

gagtagttctgctcttaaacgtagggaggccctgagaactaaattttgccccaaaataaa 

aacagaaattatgagattgcctcctgtcattttggttaacccagtccttcacctgccctg 

tgtcagtgtcttctgagggcaattgcgttgctcaaatcactagcacagaggttccttaat 

ttggggccttagaaaccattgtgggccttggggtccatgaaccccatgaaattatttgta 

gacttgtatgtacatttttctggggagaaggttcaagagattcataagattgtcaaactc 

cttgaaggttcagaacctctgcagggaagggggaagaaaaccctcccattaggaagcatg 

cttttgcagttaaatggcgatggtggaggtgatagggacttcaagagtaaaatgcacctt 

gtattgcataagaagcatacacaaatcaataaatcaagggagattataccagtaggactg 

aatcagggccttcaaagctggactgagttggtcctgttctggcacatatggtccactgga 

gacaatgtatgattgagcttttctttggtctaaaaattatattaaacatttattttgaaa 

aaaaaaaaaaaaaaaaaaaaaaaa 

>141944 (TOXMARKER Assignment: 42; SEQ ID NO: 135) 

cccgcactaaagacgcttcttcccggcgggtaggaatcccgccggcgagccgaacagttc 

cccgagcgcagcccgcggaccaccacccggccgcacgggccgcttttgtcccccgcccgc 

cgcttctgtccgagaggccgcccgcgaggcgcatcctgaccgcgagcgtcgggtcccaga 

gccgggcgcggctggggcccgaggctagcatctctcgggagccgcaaggcgagagctgca 

aagtttaattagacacttcagaattttgatcacctaatgttgatttcagatgtaaaagtc 

aagagaagactctaaaaatagcaaagatgcttttgagccagaatgccttcatcttcagat 

cacttaatttggttctcatggtgtatatcagcctcgtgtttggtatttcatatgattcgc 

ctgattacacagatgaatcttgcactttcaagatatcattgcgaaatttccggtccatct 

tatcatgggaattaaaaaaccactccattgtaccaactcactatacattgctgtatacaa 

tcatgagtaaaccagaagatttgaaggtggttaagaactgtgcaaataccacaagatcat 

tttgtgacctcacagatgagtggagaagcacacacgaggcctatgtcaccgtcctagaag 

gattcagcgggaacacaacgttgttcagttgctcacacaatttctggctggccatagaca 

tgtcttttgaaccaccagagtttgagattgttggttttaccaaccacattaatgtgatgg 

tgaaatttccatctattgttgaggaagaattacagtttgatttatctctcgtcattgaag 

aacagtcagagggaattgttaagaagcataaacccgaaataaaaggaaacatgagtggaa 

atttcacctatatcattgacaagttaattccaaacacgaactactgtgtatctgtttatt 

tagagcacagtgatgagcaagcagtaataaagtctcccttaaaatgcaccctccttccac 

ctggccaggaatcagaatcagcagaatctgccaaaataggaggaataattactgtgtttt 

tgatagcattggtcttgacaagcaccatagtgacactgaaatggattggttatatatgct 

taagaaatagcctccccaaagtcttgaggcaaggtctcgctaagggctggaatgcagtgg 

ctattcacaggtgcagtcataatgcactacagtctgaaactcctgagctcaaacagtcgt 

cctgcctaagcttccccagtagctgggattacaagcgtgcatccctgtgccccagtgatt 

aagttttattatgtagaaaataaagagcaaacagtacagctgatatggactctctctctc 
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tttttttttttttttaagaattttcataactttttagcctggccatttcctaacctgcca 
ccgttggaagccatggatatggtggaggtcatttacatcaacagaaagaagaaagtgtgg 
gattataattatgatgatgaaagtgatagcgatactgaggcagcgcccaggacaagtggc 
ggtggctataccatgcatggactgactgtcaggcctctgggtcaggcctctgccacctct 
acagaatcccagttgatagacccggagtccgaggaggagcctgacctgcctgaggttgat 
gtggagctccccacgatgccaaaggacagccctcagcagttggaactcttgagtgggccc 
tgtgagaggagaaagagtccactccaggacccttttcccgaagaggactacagctccacg 
gaggggtctgggggcagaattaccttcaatgtggacttaaactctgtgtttttgagagtt 
cttgatgacgaggacagtgacgacttagaagcccctctgatgc tat cgtc teat ctggaa 
gagatggttgacccagaggatcctgataatgtgcaatcaaaccatttgctggccagcggg 
gaagggacacagccaacctttcccagcccctcttcagagggcctgtggtccgaagatgct 
ccatctgatcaaagtgacacttctgagtcagatgttgaccttggggatggttatataatg 
agatgactccaaaactattgaatgaacttggacagacaagcacctacagggttctttgtc 
tctgcatcctaacttgctgccttatcgtctgcaagtgttctccaagggaaggaggaggaa 
actgtggtgttcctttcttccaggtgacatcacctatgcacattcccagtatggggacca 
tagtatcattcagtgcattgtttacatattcaaagtggtgcactttgaaggaagcacatg 
tgcacctttcctttacactaatgcacttaggatgtttctgcatcatgtctaccagggagc 
agggttccccacagtttcagaggtggtccaggaccctatgatatttctcttctttcgttc 
ttttttttttttttttgagacagagtctcgttctgtcgcccaagctggagcgcaatggtg 
tgatcttggctcactgcaacatccgcctcccgggttcaggtgattctcctgcctcagcct 
ccctcgcaagtagctgggattacaggcgcctgccaccatgcctagcaaatttttgtattt 
ttagtggagacaggattttaccatgttggccaggctggtctcgaactcctgacctcaagt 
gatctgccctcctcagcctcgtaaagtgctgggattacaggggtgagccgctgtgcctgg 
ctggccctgtgatatttctgtgaaataaattgggccagggtgggagcagggaaagaaaag 
gaaaatagtagcaagagctgcaaagcaggcaggaagggaggaggagagccaggtgagcag 
tggagagaaggggggccctgcacaaggaaacagggaagagccatcgaagtttcagtcggt 
gagccttgggcacctcacccatgtcacatcctgtctcctgcaattggaattccaccttgt 
ccagccctccccagttaaagtggggaagacagactttaggatcacgtgtgtgactaatac 
agaaaggaaacatggcgtcggggagagggataaaacctgaatgccatattttaagttaaa 
aaaaaaaaa 

>bc015897 (TOXMARKER Assignment: 93; SEQ ID NO: 159) 

gccttctaaagcctctgaatgcaattacatgtatttcagaacattctaaagaagtaataa 

atcatcatccagatgtacaaacaaaagatgataagctcaaaaactcagttttggcccaag 

gtcctggtgctaccagttcagctgcaaatacctgtaaggtacagccacttactcttaaag 

agactgctgaaagttttggaagcccaccaaaagaagaaatgggaaatgaacacatcagtg 

tccaccctgaaaactcagactgtatccaagcagatgttaactctgatgattacaagggtg 

ataaagtataccatccagaaacaggaaggaaaaacgagaaagagaaagttggaaggaagg 

gcaagcatctgttgactgttgatcagaaacgtggagagcatgttgtctgtggcagcacac 

gtaataatgagtcagagagcactttggatttagaaggcttccaaaatcccacagctaaag 

agtgtgagggacttgccactttagataacaaggctgatcttgatggagaaagtacagaag 

gtactgaggaactagaagactctctaaaccactttacccactcagtacagggccagacat 

cagaaatgattccctctgatgaagaggaggaggacgacgaagaagaggaggaggaagaag 

aacctaggctcaccattaaccaaagggaagatgaagatggcatggctaatgaagatgagt 

tagacaacacctacactgggtcaggggatgaggacgccctatctgaagaggatgatgagt 

taggcgaagctgctaagtatgaagacgtgaaagaatgtggaaaacatgtagaaagagctc 

tcctagtggaacttaataaaataagtctcaaggaagaaaatgtatgtgaagaaaaaaatt 

cacctgtggatcagtctgattttttttatgaattcagtaaacttatcttcaccaaaggca 

agtctcctacggtagtgtgcagcttatgcaaacgagagggtcatctaaagaaggactgtc 

ctgaagacttcaaaagaatccagctagaacctctgccaccattaacacccaagtttttaa 

atatcttagatcaagtctgtatccagtgttataaggatttttctccaacaattatagaag 

atcaggctcgtgaacatattcggcaaaacctagaaagtttcataagacaggactttccag 

gaactaaattgagcctgtttggctcctccaaaaatggatttgggttcaaacagagtgacc 

ttgacgtctgtatgacaattaatggacttgaaactgctgagggattggactgtgtcagaa 

ctattgaagaattagcaagagtcctcagaaaacattcaggtctgagaaacatcttaccta 

ttacaacagcaaaggtgccaattgtgaagttcttccatttgagaagtggtctggaagtag 

atatcagtttgtataacacattggcccttcataacacaaggcttttatctgcttattccg 

ccattgatcccagagtgaagtatttgtgctataccatgaaagtatttacaaagatgtgtg 

atattggtgatgcatctagaggcagcttatcatcgtatgcatatactcttatggtgctat 

attttctccagcagaggaatccaccagtcattcctgtccttcaagagatatacaaaggtg 

aaaagaaacctgaaatatttgttgatggctggaatatttatttttttgatcaaatagatg 
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aactgcctacctattggtcagaatgtggaaaaaatacagaatctgttgggcagttatggt 
tgggccttcttcgtttctacacagaggaatttgattttaaagaacatgttattagcatca 
ggagaaaaagtctgcttacaacttttaagaaacagtggacctcaaaatacattgttattg 
aagatccctttgatttgaatcataatcttggagctggattatcaaggaaaatgacaaatt 
ttataatgaaggcttttatcaatggtagaagagtatttggtattcctgtcaagggatttc 
caaaggactacccctcaaaaatggaatacttttttgatccagatgtgttaactgaaggag 
agctggccccaaatgatagatgttgtcgaatttgtggaaaaatcggacacttcatgaagg 
actgtcctatgaggagaaaagtaagacggcggcgagatcaggaagatgccctgaaccaaa 
gataccctgagaacaaggaaaaaagaagcaaagaggacaaagaaattcacaacaagtaca 
cagaaagggaggtgtcaacaaaagaagataagcccatacagtgcacacctcagaaagcca 
agccaatgcgggcagctgctgacctggggagggagaagatcctcaggccaccagtagaaa 
aatggaagagacaggatgacaaagacttaagagaaaaacgttgttttatttgtggaagag 
aagggcacattaaaaaggaatgcccacagtttaaaggctcttcaggtagcctttccagta 
aatatatgactcagggaaaagcctcagcgaagaggacccagcaggaatcatgagggaagg 
aaaatgcagcactctaaatggccactcaggcgttcctattcactcggaaaattaggttca 
tttcacaggacacagcagtgtagatcaggcttcaacttaacatttaagggaaatgtcaga 
tttttttttaatttaatgaaattgttaatgaggaaaaatttttaatatagtcttatctac 
cacacatccccatagatttaaggattttaatagaaagtcatgatgtatgtatttaagcca 
cgttaaaagaaaaaatataactatggaccggtattcagtgaatacagtttcatggttttt 
aattctttcaaagcacattaaaaatggtgtgctgataaaccccaagtaaattaacccttt 
ttccgtataaatccattttttgttttgaagaggggaaattatatttattgttgtttactg 
aatcctggtgtgaaagcatatcagatatgtatgaactgctactgctgtacttccgattta 
cggacatcattttattgctatttgtagacgtgataacatgaacatgagtacctatttatg 
tgggccttcagtggatgggcagtgccactcaggtctctggggtttccctctctaatttta 
agtaaattgacatataactactatgcttataaaaatgaagtaaggaaaacaagtagtcct 
gtttgccactaaaaacattttcaaaggaaaaataaaatgaaagtactttt tact ttt tat 
gatactcagaaattaggatgaagaacttttaaaattgctgaagatcaaagaggttatctc 
tgccagtcacaagtgtggctggtgtcattctgggtctgactggagccctcctggactgtt 
tctttaatttcaaaagccctgcagacatagtacctggtcagaactatgcctcggtttatt 
tatcattttgaaataaaatcaaaatttcaacctgtaaaaaaaaaaaaaaaaaaaaaaaaa 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 
aaaaaa 

>bc002457 (TOXMARKER Assignment: 95; SEQ ID NO: 160) 

ggcacgaggccgccttctgcatcgcggcttcggcggcttccacctagacacctaacagtc 

gcggagccggccgcgtcgtgagggggtcggcacggggagtcgggcggtcttgtgcatctt 

ggctacctgtgggtcgaagatgtcggacatcggagactggttcaggagcatcccggcgat 

cacgcgctattggttcgccgccaccgtcgccgtgcccttggtcggcaaactcggcctcat 

cagcccggcctacctcttcctctggcccgaagccttcctttatcgctttcagatttggag 

gccaatcactgccaccttttatttccctgtgggtccaggaactggatttctttatttggt 

caatttatatttcttatatcagtattctacgcgacttgaaacaggagcttttgatgggag 

gccagcagactatttattcatgctcctctttaactggatttgcatcgtgattactggctt 

agcaatggatatgcagttgctgatgattcctctgatcatgtcagtactttatgtctgggc 

ccagctgaacagagacatgattgtatcattttggtttggaacacgatttaaggcctgcta 

tttaccctgggttatccttggattcaactatatcatcggaggctcggtaatcaatgagct 

tattggaaatctggttggacatctttattttttcctaatgttcagatacccaatggactt 

gggaggaagaaattttctatccacacctcagtttttgtaccgctggctgcccagtaggag 

aggaggagtatcaggatttggtgtgccccctgctagcatgaggcgagctgctgatcagaa 

tggcggaggcgggagacacaactggggccagggctttcgacttggagaccagtgaagggg 

cggcctcgggcagccgctcctctcaagccacatttcctcccagtgctgggtgcgcttaac 

aactgcgttctggctaacactgttggacctgacccacactgaatgtagtctttcagtacg 

agacaaagtttcttaaatcccgaagaaaaatataagtgttccacaagtttcacgattctc 

attcaagtccttactgctgtgaagaacaaataccaactgtgcaaattgcaaaactgaaaa 

aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 

aaa 

>af 161504 (TOXMARKER Assignment: 97; SEQ ID NO: 154) 

ggtgccttatgcggtgattttaatgataggtgtcatatataggacggagtaatctgttta 

cattctgttcttctcgatgcactcacaagcgggtaactaggtgacaagaaaacaaagatc 

ttattcaaaagaggtcttacagcaacccaacgtctcatcttcccatagtaaagatgacgg 

cgccttgaggtaagctacaggcaacaccacttccgcgtttctcttgcgccctggtccaag 
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atggcggatgaagccacgcgacgtgttgtgtctgagatcccggtgctgaagactaacgcc 
ggaccccgagatcgtgagttgtgggtgcagcgactgaaggaggaatatcagtcccttatc 
cggtatgtggagaacaacaagaatgctgacaacgattggttccgactggagtccaacaag 
gaaggaactcggtggtttggaaaatgctggtatatccatgacctcctgaaatatgagttt 
gacatcgagtttgacattcctatcacatatcctactactgccccagaaattgcagttcct 
gagctggatggaaagacagcaaagatgtacaggggtggcaaaatatgcctgacggatcat 
ttcaaacctttgtgggccaggaatgtgcccaaatttggactagctcatctcatggctctg 
gggctgggtccatggctggcagtggaaatccctgatctgattcagaagggcgtcatccac 
cacaaagagaaatgcaaccaatgaagaatcaagccactgaggcagggcagagggaccttt 
gataggctacgatactattttcctgtgcatcacacttaactcatctaactgcttccccgg 
acaccctccacctctagttgttactaagtagctgcagtaggcattgctggggaagaaaca 
aacacacaccaaacagtactgctacttagtttctaaggctgcacagggaagggaaagact 
gggctttggacaatctagaggtaatttatatccgcccccaggtggagcaacatgcgattc 
tggaggcacgggggtaactgaaagtgagtacatatagtctttctggtttctggagataac 
ccatcaataaaagctgcttcctctggtaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 

>af 085361 (TOXMARKER Assignment: 60; SEQ ID NO: 146) 

ggtcgcttggtggctccgtctgttgtccgtccgcccgcgggtgccatcatggcggacgcg 

gccagtcaggtgctcctgggctccggtctcaccatcctgtcccagccgctcatgtacgtg 

aaagtgctcatccaggtgggatatgagcctcttcctccaacaataggacgaaatattttt 

gggcggcaagtgtgtcagcttcctggtctctttagttatgctcagcacattgccagtatc 

gatgggaggcgcgggttgttcacaggcttaactccaagactgtgttcgggagtccttgga 

actgtggtccatggtaaagttttacagcattaccaggagagtgacaagggtgaggagtta 

ggacctggaaatgtacagaaagaagtctcatcttcctttgaccacgttatcaaggagaca 

actcgagagatgatcgctcgttctgctgctaccctcatcacacatcccttccatgtgatc 

actctgagatctatggtacagttcattggcagagaatccaagtactgtggactttgtgat 

tccataataaccatctatcgggaagagggcattctaggatttttcgcgggtcttgttcct 

cgccttctaggtgacatcctttctttgtggctgtgtaactcactggcctacctcgtcaat 

acctatgcactggacagtggggtttctaccatgaatgaaatgaagagttattctcaagct 

gtcacaggattttttgcgagtatgttgacctatccctttgtgcttgtctccaatcttatg 

gctgtcaacaactgtggtcttgctggtggatgccctccttactccccaatatatacgtct 

tggatagactgttggtgcatgctacaaaaagaggggaatatgagccgaggaaatagctta 

tttttccggaaggtcccctttgggaagacttattgttgtgacctgaaaatgttaatttga 

agatgtggggcagggacagtgacatttctgtagtcccagatgcacagaattatgggagag 

aatgttgatttctatacagtgtggcgcgcttttttaataatcatttaatcttgggaaaat 

taaaaaaaaaaaaaaaaaaaaaaa 

>af 110646 (TOXMARKER Assignment: 54/ SEQ ID NO: 144) 

ccaagatggcggcggcagacacagcagcagcagccagtattcgggaaaggcagacagtgg 

ctttgaagcgtatgttgaatttcaatgtgcctcatattaaaaacagcacaggagaaccag 

tatggaaggtactcatttatgacagatttggccaagatataatctctcctctgctatctg 

tgaaggagctaagagacatgggaatcactctgcatctgcttttacactctgatcgagatc 

ctattccagatgttcctgcagtatactttgtaatgccaactgaagaaaatattgacagaa 

tgtgccaggatcttcgaaatcaactatatgaatcatattatttaaattttatttctgcta 

tttcaagaagtaaactggaagatattgcaaatgcagcgttagcagctagtgcagtaacac 

aagtagccaaggtttttgaccaatatctcaattttattactttggaagatgatatgtttg 

tattatgtaatcaaaataaggagcttgtttcatatcgtgccattaacaggccagatatca 

cagacacggaaatggaaactgttatggacactatagttgacagcctcttctgcttttatg 

gtactcggggtgatgttcctataatcagatgttcaagaggaacagcagcagaaatggtag 

cagtgaaactagacaagaaacttcgagaaaatctaagagatgcaagaaacagtcttttta 

caggtgatacacttggagctggccaattcagcttccagaggcccttattagtccttgttg 

acagaaacatagatttggcaactcctttacatcatacttggacatatcaagcattggtgc 

acgatgtactggatttccatttaaacagggttaatttggaagaatcttcaggagtggaaa 

actctccagctggtgctagaccaaagagaaaaaacaagaagtcttatgatttaactccgg 

ttgataaattttggcaaaaacataaaggaagtccattcccagaagttgcagaatcagttc 

agcaagaactagaatcttacagagcacaggaagatgaggtcaaacgacttaaaagcatta 

tgggactagaaggggaagatgaaggagccataagtatgctttctgacaataccgctaagc 

taacat cage tgttagttctttgccagaactccttgagaaaaaaagacttattgatc tec 

atacaaatgttgccactgctgttttagaacatataaaggcaagaaaattggatgtatatt 

ttgaatatgaagaaaaaataatgagcaaaactactctggataaatctcttctagatataa 

tatcagaccctgatgcaggaactccagaagataacatgaggttgtttcttatctattata 
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taagcacacagcaagcaccttctgaggctgatttggagcaatataaaaaagctttaactg 
atgcaggaaaccttaatcctttacaatatatcaaacagtggaaggcttttaccaagatgg 
cctcagctccggccagctatggcagcactaccactaaaccaatgggtcttttatcacgag 
tcatgaatacaggatcacagtttgtgatggaaggagtgaagaacctggttttgaaacagc 
5 aaaatctacctgttactcgtattttggacaatcttatggagatgaagtcaaaccccgaaa 
ctgatgactatagatattttgatcccaaaatgctgcggggcaatgacagctcagttccca 
gaaataaaaatccattccaagaggccattgtttttgtggtgggaggaggcaactacattg 
aatatcagaatcttgttgactacataaaggggaaacaaggcaaacacattttatatggct 
gcagtgagctttttaatgctacacagttcataaaacagttgtcacaacttggacaaaagt 
10 aacacagaagaaccttactatgataatctacttggaatgtggataaatgtaaaaagaaga 
aaagttagaagagcaatatgtttccttctctgtaacagtgtcctaacagtgaaaatcaga 
gttatttgttaatttttaaggaaattatatacttaatatgtattgattaaaagaaacatt 
tcagaaataaaatttcaacattgaaaaaaaaaaaaaaaa 

15 >bc015110 (TOXMARKER Assignment: 66; SEQ ID NO: 151) 

ggggagctctgcgaggggccggagcgcggcggagccatgcagtacccgcaccccgggccg 
gcggcgggcgccgtgggggtgccgctgtacgcgcccacgccgctgctgcaacccgcacac 
ccgacgcccttttacatcgaggacatcctgggccgcgggcccgccgcgcccacgcccgcc 
cccacgctgccgtcccccaactcctccttcaccagcctcgtgtccccctaccggaccccg 

20 gtgtacgagcccacgccgatccatccagccttctcgcaccactccgccgccgcgctggcc 
gctgcctacggacccggcggcttcgggggccctctgtaccccttcccgcggacggtgaac 
gactacacgcacgccctgctccgccacgaccccctgggcaaacctctactctggagcccc 
ttcttgcagaggcctctgcataaaaggaaaggcggccaggtgagattctccaacgaccag 
accatcgagctggagaagaaattcgagacgcagaaatatctctctccgcccgagaggaag 

25 cgtctggccaagatgctgcagctcagcgagagacaggtcaaaacctggtttcagaatcga 
cgcgctaaatggaggagactaaaacaggagaaccctcaaagcaataaaaaagaagaactg 
gaaagtttggacagttcctgtgatcagaggcaagatttgcccagtgaacagaataaaggt 
gcttctttggatagctctcaatgttcgccctcccctgcctcccaggaagaccttgaatca 
gagatttcagaggattctgatcaggaagtggacattgagggcgataaaagctattttaat 

30 gctggatgatgaccactggcattggcatgttcagaaaactggatttaggaataatgtttt 
gctacagaaaatcttcatagaagaactggaaggctatataagaaagggaatcaattctct 
ggtattctggaaacctaaaaatatttggtgcactgctcaattaacaaacctacatggaga 
ccttaattttgacttaacaaatagtttatgtactgctcttaggttgttttgataaagtga 
cattatagtgattaaattcttccccctttaaaaaaacagttagtggttttcactatttat 

35 aaaaaattaattttgaactttttgttaaatttttaagttatagctttaaaggttttaata 
ggaccttcttgaacgacttttctgtaatctgtttatctcccacttaatggaaaggcaaag 
gggtaccccaaatccagaggtgcctacatttcaggcagccttggagtattttaaaaggaa 
aacattctttacttttatatgacattcttatactgctgtctcaaatccaaaaacatttca 
gagctcttgtctcagagatgtgtgttctttttgtcagagatatggttgatgagaatctta 

40 aatgcttgttttgcactatcacttagtacctgtttgaccaaggtgttaaggggatagtac 
ctcccaattcaagcagagaaactgacctgactaaagttaatcgcagatgaactagaagtc 
acaggttaattaaatgtaagtagattgtagatactgttttatatcaaacaatgtttataa 
tgtgtatatagaattgttcactgtaaaaaaaatggccaaaatgtgttttttttttaataa 
gtaacttgactataaaataaagccgtccgtgggacgactgacctcgtaaaaaaaaaaaaa 

45 

aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 
aaaaaaaaaaaaaaaaaaaaaaaaa 

>nm_004046 (TOXMARKER Assignment: 39; SEQ ID NO: 132) 
gtcccagtcagtccggaggctgcggctgcagaagtaccgctgcggagtaactgcaaagat 

50 gctgtccgtgcgcgttgctgcggccgtggtccgcgcccttcctcggcgggccggactggt 
ctccagaaatgctttgggttcatctttcattgctgcaaggaacttccatgcctctaacac 
tcatcttcaaaagactgggactgctgagatgtcctctattcttgaagagcgtattcttgg 
agctgatacctctgttgatcttgaagaaactgggcgtgtcttaagtattggtgatggtat 
tgcccgcgtacatgggctgaggaatgttcaagcagaagaaatggtagagttttcttcagg 

55 cttaaagggtatgtccttgaacttggaacctgacaatgttggtgttgtcgtgtttggaaa 
tgataaactaattaaggaaggagatatagtgaagaggacaggagccattgtggacgttcc 
agttggtgaggagctgttgggtcgtgtagttgatgcccttggtaatgctattgatggaaa 
gggtccaattggttccaagacgcgtaggcgagttggtctgaaagcccccggtatcattcc 
tcgaatttcagtgcgggaaccaatgcagactggcattaaggctgtggatagcttggtgcc 

60 aattggtcgtggtcagcgtgaactgattattggtgaccgacagactgggaaaacctcaat 
tgctattgacacaatcattaaccagaaacgtttcaatgatggatctgatgaaaagaagaa 
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gctgtactgtatttatgttgctattggtcaaaagagatccactgttgcccagttggtgaa 
gagacttacagatgcagatgccatgaagtacaccattgtggtgtcggctacggcctcgga 
tgctgccccacttcagtacctggctcct tact ctggctgttccatgggagagtattt tag 
agacaatggcaaacatgctttgatcatctatgacgacttatccaaacaggctgttgctta 
5 ccgtcagatgtctctgttgctccgccgaccccctggtcgtgaggcctatcctggtgatgt 
gttctacctacactcccggttgctggagagagcagccaaaatgaacgatgcttttggtgg 
tggctccttgactgctttgccagtcatagaaacacaggctggtgatgtgtctgcttacat 
tccaacaaatgtcatttccatcactgacggacagatcttcttggaaacagaattgttcta 
caaaggtatccgccctgcaattaacgttggtctgtctgtatctcgtgtcggatccgctgc 

10 ccaaaccagggctatgaagcaggtagcaggtaccatgaagctggaattggctcagtatcg 
tgaggttgctgcttttgcccagttcggttctgacctcgatgctgccactcaacaactttt 
gagtcgtggcgtgcgtctaactgagttgctgaagcaaggacagtattctcccatggctat 
tgaagaacaagtggctgttatctatgcgggtgtaaggggatatcttgataaactggagcc 
cagcaagattacaaagtttgagaatgctttcttgtctcatgtcgtcagccagcaccaagc 

15 cttgttgggcactatcagggctgatggaaagatctcagaacaatcagatgcaaagctgaa 
agagattgtaacaaatttcttggctggatttgaagcttaaactcctgtggattcacatca 
aataccagttcagttttgtcattgttctagtaaattagttccatttgtaaaagggttact 
ctcatactccttatgtacagaaatcacatgaaaaataaaggttccataatgcaaaaaaaa 
aaaaaaaaaaaaaaaaaaaaaaa 

20 

>s52379 (TOXMARKER Assignment: 58; SEQ ID NO: 147) 
atggatcccaactgctcctgcgccgccggtgactcctgcacctgcgccggctcctgcaaa 
tgcaaagagtgcaaatgcacctcctgcaagaaaagctgctgctcctgctgccctgtgggc 
tgtgccaagtgtgcccagggctgcatctgcaaaggggcgtcggacaagtgcagctgctgc 
25 gcctga 

>k03193 (TOXMARKER Assignment: 40; SEQ ID NO: 136) 

gccccggcgccgccgccgcccagaccggacgacaggccacctcgtcggcgtccgcccgag 

tccccgcctcgccgccaacgccacaaccaccgcgcacggccccctgactccgtccagtat 

30 tgatcgggagagccggagcgagctcttcggggagcagcgatgcgaccctccgggacggcc 
ggggcagcgctcctggcgctgctggctgcgctctgcccggcgagtcgggctctggaggaa 
aagaaagtttgccaaggcacgagtaacaagctcacgcagttgggcacttttgaagatcat 
tttctcagcctccagaggatgttcaataactgtgaggtggtccttgggaatttggaaatt 
acctatgtgcagaggaattatgatctttccttcttaaagaccatccaggaggtggctggt 

35 tatgtcctcattgccctcaacacagtggagcgaattcctttggaaaacctgcagatcatc 
agaggaaatatgtactacgaaaattcctatgccttagcagtcttatctaactatgatgca 
aataaaaccggactgaaggagctgcccatgagaaatttacaggaaatcctgcatggcgcc 
gtgcggttcagcaacaaccctgccctgtgcaacgtggagagcatccagtggcgggacata 
gtcagcagtgactttctcagcaacatgtcgatggacttccagaaccacctgggcagctgc 

40 caaaagtgtgatccaagctgtcccaatgggagctgctggggtgcaggagaggagaactgc 
cagaaactgaccaaaatcatctgtgcccagcagtgctccgggcgctgccgtggcaagtcc 
cccagtgactgctgccacaaccagtgtgctgcaggctgcacaggcccccgggagagcgac 
tgcctggtctgccgcaaattccgagacgaagccacgtgcaaggacacctgccccccactc 
atgctctacaaccccaccacgtaccagatggatgtgaaccccgagggcaaatacagcttt 

45 ggtgccacctgcgtgaagaagtgtccccgtaattatgtggtgacagatcacggctcgtgc 
gtccgagcctgtggggccgacagctatgagatggaggaagacggcgtccgcaagtgtaag 
aagtgcgaagggccttgccgcaaagtgtgtaacggaataggtattggtgaatttaaagac 
tcactctccataaatgctacgaatattaaacacttcaaaaactgcacctccatcagtggc 
gatctccacatcctgccggtggcatttaggggtgactccttcacacatactcctcctctg 

50 gatccacaggaactggatattctgaaaaccgtaaaggaaatcacagggtttttgctgatt 
caggcttggcctgaaaacaggacggacctccatgcctttgagaacctagaaatcatacgc 
ggcaggaccaagcaacatggtcagttttctcttgcagtcgtcagcctgaacataacatcc 
ttgggattacgctccctcaaggagataagtgatggagatgtgataatttcaggaaacaaa 
aatttgtgctatgcaaatacaataaactggaaaaaactgtttgggacctccggtcagaaa 

55 accaaaattataagcaacagaggtgaaaacagctgcaaggccacaggccaggtctgccat 
gccttgtgctcccccgagggctgctggggcccggagcccagggactgcgtctcttgccgg 
aatgtcagccgaggcagggaatgcgtggacaagtgcaaccttctggagggtgagccaagg 
gagtttgtggagaactctgagtgcatacagtgccacccagagtgcctgcctcaggccatg 
aacatcacctgcacaggacggggaccagacaactgtatccagtgtgcccactacattgac 

60 ggcccccactgcgtcaagacctgcccggcaggagtcatgggagaaaacaacaccctggtc 
tggaagtacgcagacgccggccatgtgtgccacctgtgccatccaaactgcacctacgga 
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tgcactgggccaggtcttgaaggctgtccaacgaatggaagctacatagtgtctcacttt 
ccaagatcattctacaagatgtcagtgcactgaaacatgcaggggcgtgttgagtgtgga 
aggatcttgacaagttgttttgaagatagcattttgctaagtccctgaggtcactggtcc 
tcaaagcggcatggcgcatggcgtggctggttctgccacatgccagctgtgtgacctctg 
agactccacttcttccgtgctgaaaataaagaaggagttttactaaggaccaaacaagat 
aatgaatgtgaaactgctccatgaaccccaaagaattatgcacatagatgcgatcattaa 
gatgcgaagccatcgagttaccacctggcatgcttaaactgtaaagagtgggtcaaagta 
aactgaattggaaaatccaaagttatgcagaaaaacaataaaggagatagtaaaaagggt 
taacgagccagtccaggggaagcgaagaagacaaaaagagtccttttctgggccaagttt 
gataaattaggcctcccgaccctttgctctgttgctttatcaactctactcggcaataac 
aat 

>bc017832 (TOXMARKER Assignment: 41; SEQ ID NO: 137) 

gcccctccctccgcccgcccgccggcccgcccgtcagtctggcaggcaggcaggcaatcg 

gtccgagtggctgtcggctcttcagctctcccgctcggcgtcttccttcctcctcccggt 

cagcgtcggcggctgcaccggcggcggcgcagtccctgcgggaggggcgacaagagctga 

gcggcggccgccgagcgtcgagctcagcgcggcggaggcggcggcggcccggcagccaac 

atggcggcggcggcggcggcgggcgcgggcccggagatggtccgcgggcaggtgttcgac 

gtggggccgcgctacaccaacctctcgtacatcggcgagggcgcctacggcatggtgtgc 

tctgcttatgataatgtcaacaaagttcgagtagctatcaagaaaatcagcccctttgag 

caccagacctactgccagagaaccctgagggagataaaaatcttactgcgcttcagacat 

gagaacatcattggaatcaatgacattattcgagcaccaaccatcgagcaaatgaaagat 

gtatatatagtacaggacctcatggaaacagatctttacaagctcttgaagacacaacac 

ctcagcaatgaccatatctgctattttctctaccagatcctcagagggttaaaatatatc 

cattcagctaacgttctgcaccgtgacctcaagccttccaacctgctgctcaacaccacc 

tgtgatctcaagatctgtgactttggcctggcccgtgttgcagatccagaccatgatcac 

acagggttcctgacagaatatgtggccacacgttggtacagggctccagaaattatgttg 

aattccaagggctacaccaagtccattgatatttggtctgtaggctgcattctggcagaa 

atgctttctaacaggcccatctttccagggaagcattatcttgaccagctgaaccacatt 

ttgggtattcttggatccccatcacaagaagacctgaattgtataataaatttaaaagct 

aggaactatttgctttctcttccacacaaaaataaggtgccatggaacaggctgttccca 

aatgctgactccaaagctctggacttattggacaaaatgttgacattcaacccacacaag 

aggattgaagtagaacaggctctggcccacccatatctggagcagtattacgacccgagt 

gacgagcccatcgccgaagcaccattcaagttcgacatggaattggatgacttgcctaag 

gaaaagctcaaagaactaatttttgaagagactgctagattccagccaggatacagatct 

taaatttgtcaggtacctggagtttaatacagtgagctctagcaagggaggcgctgcctt 

ttgtttctagaatattatgttcctcaaggtccattattttgtattcttttccaagctcct 

tattggaaggtatttttttaaatttagaattaaaaattatttagaaaaaaaaaaaaaaaa 

aaaaaaaaaaaaaa 

>ah002741 (TOXMARKER Assignment: 98; SEQ ID NO: 161) 



>aw2 934 99 (TOXMARKER Assignment: 51; SEQ ID NO: 142) 

tttttttttttttttttgccggtcggagactcccgtctgccaaggtttttattgtggtcc 

cgcggggcaggaggtatgcatggcatacgtaagcagagagccggaggcagccatcggcac 

ctagaacggtgcagagttggcccaggagcgtggcggggcaggcggcctgcacctgccctg 

ctcgcccagcagaccctccgggctccagcctggcggggcccagcgtccaccttggtgggc 

ccaggtcagatcttggccagggtggagtgggcgtcggcctgctcctcttggatgggggtc 

cggaaactgcctcccccagggggcttgtgggcatcggggggcagcctctggtccctccgg 

agcaggtacagggccagcaggatgggcagggggcccagcagccccagcaccaggcccagg 

cccaggatggggggaaccgcacgggccccggggacctccacgggccgggtg 

>aj3 00837 (TOXMARKER Assignment: 46; SEQ ID NO: 13 8) 

ggcacgagggtggcctctgcggctaggccggctcgagactcccgggcgccgaggcgctgc 

cgcccgcctcgccgccccacgccgaaggaccacgcgcccgccgccgccagcctctcagcg 

ctcccatgatcgcccggtgccttttggctgtgcgaagcctccgcagagttggtggttcca 

ggattttactcagaatgacgttaggaagagaagtgatgtctcctcttcaggcaatgtctt 

cctatactgtggctggcagaaatgttttaagatgggatctttcaccagagcaaattaaaa 

caagaactgaggagctcattgtgcagaccaaacaggtgtacgatgctgttggaatgctcg 

gtattgaggaagtaacttacgagaactgtctgcaggcactggcagatgtagaagtaaagt 
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atatagtggaaaggaccatgctagactttccccagcatgtatcctctgacaaagaagtac 
gagcagcaagtacagaagcagacaaaagactttctcgttttgatattgagatgagcatga 
gaggagatatatttgagagaattgttcatttacaggaaacctgtgatctggggaagataa 
aacctgaggccagacgatacttggaaaagtcaattaaaatggggaaaagaaatgggctcc 
5 atcttcctgaacaagtacagaatgaaatcaaatcaatgaagaaaagaatgagtgagctat 
gtattgattttaacaaaaacctcaatgaggatgataccttccttgtattttccaaggctg 
aacttggtgctcttcctgatgatttcattgacagtttagaaaagacagatgatgacaagt 
ataaaattaccttaaaatatccacactatttccctgtcatgaagaaatgttgtatccctg 
aaaccagaagaaggatggaaatggcttttaatacaaggtgcaaagaggaaaacaccataa 

10 ttttgcagcagctactcccactgcgaaccaaggtggccaaactactcggttatagcacac 
atgctgacttcgtccttgaaatgaacactgcaaagagcacaagccgcgtaacagcctttc 
tagatgatttaagccagaagttaaaacccttgggtgaagcagaacgagagtttattttga 
atttgaagaaaaaggaatgcaaagacaggggttttgaatatgatgggaaaatcaatgcct 
gggatctatattactacatgactcagacagaggaactcaagtattccatagaccaagagt 

15 tcctcaaggaatacttcccaattgaggtggtcactgaaggcttgctgaacacctaccagg 
agttgttgggactttcatttgaacaaatgacagatgctcatgtttggaacaagagtgtta 
cactttatactgtgaaggataaagctacaggagaagtattgggacagttctatttggacc 
tctatccaagggaaggaaaatacaatcatgcggcctgcttcggtctccagcctggctgcc 
ttctgcctgatggaagccggatgatggcagtggctgccctcgtggtgaacttctcacagc 

20 cagtggcaggtcgtccctctctcctgagacacgacgaggtgaggacttactttcatgagt 
ttggtcacgtgatgcatcagatttgtgcacagactgattttgcacgatttagcggaacaa 
a tgtggaaactgactttgtagaggtgccatcgcaaatgcttgaaaattgggtgtgggacg 
tcgattccctccgaagattgtcaaaacattataaagatggaagccctattgcagacgatc 
tgcttgaaaaacttgttgcttctaggctggtcaacacaggtcttctgaccctgcgccaga 

25 ttgttttgagcaaagttgatcagtctcttcataccaacacatcgctggatgctgcaagtg 
aatatgccaaatactgctcagaaatattaggagttgcagctactccaggcacaaatatgc 
cagctacctttggacatttggcagggggatacgatggccaatattatggatatctttgga 
gtgaagtattttccatggatatgttttacagctgttttaaaaaagaagggataatgaatc 
cagaggttggaatgaaatacagaaacctaatcctgaaacctgggggatctctggacggca 

30 tggacatgctccacaatttcttgaaacgtgagccaaaccaaaaagcgttcctaatgagta 
gaggcctgcatgctccgtgaactggggatctttggtagccgtccatgtctggaggacaag 
tcgacatcaccatgtgttactggcctggaaactgaagggagttttgcaagtgaaaattta 
gatttctattgacatccttttgttttctaattttaaaaattataaagatgtaaatggaat 
tataaatactgtgacctaagaaaagacccactagaaagtaattgtactataaaatttcat 

35 aaaactggatttgatttctttttatgaaagtttcatatgaatgtaacttgattttttact 
attataatctagataatatgatataagagggctaagaatttttaaattgaatcatatata 
tgatataatttgatccttcttgtatcttgaagttttgtacttgggatttctggactgata 
aatgaatcatcacattcttctggtaaatattttcttggagctctgtgtcaactttgatcc 
tttgtctcccaggaaggtgtgacctctcctttgcctgcatacctcaaggccaggggaata 

40 tgcctcagtgatgcatttatctttgtatatcaggccgcatgattcccaactttctgccac 
acttaaattacgttcctccatttcagttttgtcttttctgtctaaagttcagtcaaagag 
tatcaaaaaa 

>aj242973 (TOXMARKER Assignment: 49; SEQ ID NO: 140) 

45 gcggccgcgtcgacgtgacagccggtacgcccgggtttgggcaacctcgattacgggcgg 
cctccaggcccgccagcagcgccccgcgccgcccgcccgcgcccctgccgccccccggtt 
ccggccgcggaccccactctctgccgttccggctgcggctccgctgccggtagcgccgtc 
ccccgggaccacccttcggctggcgccctcccatgctctcggccacccggagggcttgcc 
agctcctcctcctccacagcctctttcccgtcccgaggatgggcaactcggcctcgaaca 

50 tcgtcagcccccaggaggccttgccgggccggaaggaacagacccctgtagcggccaaac 
atcatgtcaatggcaacagaacagtcgaacctttcccagagggaacacagatggctgtat 
ttggaatgggatgtttctggggagctgaaaggaaattctgggtcttgaaaggagtgtatt 
caactcaagttggttttgcaggaggctatacttcaaatcctacttataaagaagtctgct 
cagaaaaaactggccatgcagaagtcgtccgagtggtgtaccagccagaacacatgagtt 

55 ttgaggaactgctcaaggtcttctgggagaatcacgacccgacccaaggtatgcgccagg 
ggaacgaccatggcactcagtaccgctcggccatctacccgacctctgccaagcaaatgg 
aggcagccctgagctccaaagagaactaccaaaaggttctttcagagcacggcttcggcc 
ccatcactaccgacatccgggagggacagactttctactatgcggaagactaccaccagc 
agtacctgagcaagaaccccaatggctactgcggccttgggggcaccggcgtgtcctgcc 

60 cagtgggtattaaaaaataattgctccccacatggtgggcctttgaggttccagtaaaaa 
tgctttcaacaaattgggcaatgcttgtgtgattcacaatcgtggcatttaaagtgcaca 
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aagtacaaaggaatttatacagattgggtttaccgaagtataatctataggaggcgcgat 
ggcaagttgataaaatgtgacttatctcctaataagttatggtgggagtggagctgtgcg 
gtttcctgtgtcttctggggtctgagtgaagatagcagggatgctgtgttcacccttctt 
ggtagaagctaaggtgtgagctgggaggttgctggacaggatgggggaccccagaagtcc 
tttatctgtgctctctgcccgccagtgccttacaatttgcaaacgtgtatagcctcagtg 
actcattcgctgaaatccttcgctttacca 

>aj002030.1 (TOXMARKER Assignment: 126; SEQ ID NO: 166) 

ccggtgatggcggctggtgatggggacgtgaagctaggcaccctggggagtggcagcgag 

agcagcaacgacggcggcagcgagagtccaggcgacgcgggagcggcagcggaaggggga 

ggctgggcggcggcggcgttggcgcttctgacggggggcggggaaatgctgctgaacgtg 

gcgctggtggctctggtgctgctgggggcctaccggctgtgggtgcgctgggggcggcgg 

ggtctgggggccggggccggggcgggcgaggagagccccgccacctctctgcctcgcatg 

aagaagcgggacttcagcttggagcagctgcgccagtacgacggctcccgcaacccgcgc 

atcctgctcgcggtcaatgggaaagtcttcgacgtgaccaaaggcagcaagttctacggc 

ccggcgggtccatatggaatatttgctggtagggatgcctccagaggactggccacattt 

tgcctagataaagatgcacttagagatgaatatgatgatctctcagatttgaatgcagta 

caaatggagagtgttcgagaatgggaaatgcagtttaaagaaaaatatgattatgtaggc 

agactcctaaaaccaggagaagaaccatcagaatatacagatgaagaagataccaaggat 

cacaataaacaggattgaactttgtaaacaaccaaagtcaggggccttcagaactgcaat 

tcttactccctttcacagactgtccggagtctttgggtttgattcacctgctgcgaaaaa 

cattcaacaaattgtgtacaagataaattaatctcactatgaagatttgaataactagac 

attatttatgctgccaaactcatttgttgcagttgtttgtaatgtctagtggggcttcat 

catcctgaaaagaaggagacagggatttttttaaagagcaagaaagtcacaatattactt 

ctttccttccttttttccttctttcctttcttctttctctttctttctttttaaaatata 

ttgaagacaaccagatatgtatttgctactcaagtgtacagatctcctcaagaaacatca 

agggactcctgtgtcacatactgtgtttttattttaacatgggtgagggaggcgacctga 

tcaggggaggtgggggtacacatcaatttgagttgttcaggctactgaaacattaaaatg 

tgaattcccaaacttttctttttggctttgtcagggaaaagaaaaatatctttataaaga 

aatctttggaaattaggagaaggaatttcaggtgggtttaagtcagagctagttccccaa 

cagaaagatcatttgaaaccagtttttatcccttctctttccttccctttccctaaatca 

aatcaatattaattgtgccttatttcacttaacatagacttgaattatttttagggaaag 

cccctataatgaattcagaaatcactacaagcagcattaagactgaagttggaatattct 

gttgaccataaaaccttgatatcattctgtgtatatagaatgtaaaaggaatattacagt 

gttaactgccatatatgtaatatacacaaactcaattagcattgtaatggccaaatgcat 

tcccccatgcttttctgttttcaaaaaaattgaaaaacaaatcaactcttatccccaaca 

gctgcctaattttaggagtctgaccctccacatctcactggtgtgggtgcatggggctgt 

ggagtgggtgtcagtatggatgtgtctgaatgtgtgaggccttggaagggactctttctg 

cagatactgtaaatacaagtaccattttaataaagcatgtacaataaaccaaaaaaaaaa 

aaaaaaaaaaaaaa 

>ak056165.1 (TOXMARKER Assignment: 110; SEQ ID NO: 167) 

ggactcgagcgctccgattggagttagggcctgcttgtctgcgtgctgcgaagtccgcgg 

ctgcccccggggccctagtcgttgggttccagggtccttcacgttccattcccaggctgg 

tctgagctccggggccgtggtcccgctgcctcctccggtcgtcgtgcggaagctgcgacg 

caggcagaccatggcagagttctcccagaaacgggggaagcggcgtagcgacgaagggct 

gggcagcatggtggacttcctcctggccaatgcccgcctggtgctgggcgtgggcggggc 

tgctgtgctgggcattgccaccctggccgtgaagcggttcattgacagggccactagccc 

gcgggatgaggatgacaccaaggcagacagctggaaggaactgagcctgctcaaggccac 

accacacctgcagccccggcctccacctgctgcccttagccagccagtgttgcccttggc 

cccctcgtcgtctgccccagaagggcctgcagaaactgatcctgaggtgacaccacagct 

cagctccccagcaccgctgtgtctgacactgcaggagaggctgctggccttcgagcggga 

ccgtgtgaccatcccagcagcccaggtggctttggccaaacagctggctggcgacatcgc 

cctggagctgcaggcctactttcggagcaagttcccggaactgccctttggggcattcgt 

gcctggggggccgctctacgacgggctgcaggcgggggctgcggaccatgtgcgtctcct 

ggtgccactggtgctggagccgggcctgtggagcctggtgccgggcgtggacactgtggc 

gagggaccctcgctgctgggccgtgcgcaggacgcagcttgagttctgcccccgtgggag 

cagcccctgggaccgcttcctggtcgggggctacttctcctcccgcgtcctgctggagct 

actccgcaaggcgctggctgcttctgtcaactggccggccattggcagccttctcgggtg 

cctgatccggcccagcatggcctcggaggagctgctgctcgaggtgcagcacgaacgcct 

ggagctcactgtggctgtgcttgtggcagtccctggggtcgatgctgacgaccgcctcct 
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cttggcctggcccctggaggggctggcggggaacctctggctgcaggacctgtatccagt 
ggaggctgctaggctgcgagccctggacgaccatgacgctgggactcgccggcggctgct 
gctgctgctgtgtgctgtctgccgtggttgctcggctctggggcagctaggccggggtca 
cc tgacccaggtggtcctgcgtctgggggaggacaacgtggattggacggaggaggcctt 
gggtgagcgcttcctgcaagccctggagctgctcatcggcagcctggagcaggccagcct 
gccctgccacttcaaccccagcgtgaacctcttcagcagcttgcgtgaggaggagattga 
cgacattggctatgcgctatacagtggcctacaggagcccgaggggctgctctaggtggg 
tggaaacgggtggttgccatgttttctaatgctggggagctgcacccacctcccttccag 
ggatttgaatagtggtttttctctagctttttgccagaacaaaggagggtacattactta 
aacccagggcatcaggatgtgcttgggctatggtggccataaaccctgagcccagagagc 
ttgggtcactgtcacctgagtgcagctgggctgcctcaggcagcttggagtgccagccat 
tcctgcaagcaccgtttcagctcttggggccaaccccaggacctttggctctgtccatca 
ccagcaaccaatccaccaacagaatgtggtttctgccatcctgggcagaagctgaaggcc 
agcttcacatttctgctgagagaaggtgacttaacgccttttccggccctagctccaggc 
gttttgaggcgtctggtgcctgatggtaggtatggtgtgtttgttctgtcccccaggggc 
tggagtcacctggtgcccctgaaggacagatttttggctgttaaaggatggcattttcct 
gctgtcttctgtgcgtttagttttcttgctgagcgggagctcagtatgacttgccaccca 
cctgatacctcagggcaaggccctttttccctccagccaggtgagtgttttcttcaggca 
gctgagggtcctgggggagctgaggctctgtgctgcacccccagcccacagctggggcat 
ctcactggagctgttccaggccccactggagagcagaggacctgatcccccactagagag 
gtccggtgtgcacagccggcctcccagtgtgccaaaatgaactgctctcagctgatggct 
gtattctgactttgaagcctgttaagaggtagcaagggggctagaggagggagattccac 
ctcccctcccaagtgaccctcctcctgcctctggtatccttccttttgaaacgaagctca 
gcttcgaagatgtgaacaagaataaaaggaaaaaattctaatgtatatat 

>nm_005968 (TOXMARKER Assignment: 114; SEQ ID NO: 162) 

cgcgccgggaacagccagtcggtgcctaacgcgagtgtatctcgagagagaagcgatcaa 

cagctgccggtctgcgcctgcgcggcgacggggcgtggccgcgggcgagtggggccaagg 

aggcagccgggagcgggggcgcaggtgttactggttgcgtcgggtcacgtgggcgcgcag 

cagaccgcggtgcagcccgttcgctcacacaaagcccagacgcggagaaaatggcggcag 

gggtcgaagcggcggcggaggtggcggcgacggagatcaaaatggaggaagagagcggcg 

cgcccggcgtgccgagcggcaacggggctccgggccctaagggtgaaggagaacgacctg 

ctcagaatgagaagaggaaggagaaaaacataaaaagaggaggcaatcgctttgagccat 

atgccaatccaactaaaagatacagagccttcattacaaacataccttttgatgtgaaat 

ggcagtcacttaaagacctggttaaagaaaaagttggtgaggtaacatacgtggagctct 

taatggacgctgaaggaaagtcaaggggatgtgctgttgttgaattcaagatggaagaga 

gcatgaaaaaagctgcggaagtcctaaacaagcatagtctgagcggaagaccactgaaag 

tcaaagaagatcctgatggtgaacatgccaggagagcaatgcaaaaggtgatggctacga 

ctggtgggatgggtatgggaccaggtggcccaggaatgattactatcccacccagtatcc 

taaataatcccaacatcccaaatgagattatccatgcattacaggctggaagacttggaa 

gcacagtatttgtagcaaatctggattataaagttggctggaagaaactgaaggaagtat 

ttagtatggctggtgtggtggtccgagcagacattcttgaagataaagatggaaaaagtc 

gtggaataggcactgttacttttgaacagtccattgaagctgtgcaagctatatctatgt 

tcaatggccagctgctatttgatagaccaatgcacgtcaagatggatgagagggccttac 

caaaaggagatttcttccctcctgagcgtccacaacaacttccccatggccttggtggta 

ttggcatggggttaggaccaggagggcaacccattgatgccaatcacctgaataaaggca 

tcggaatgggaaacataggtcccgcaggaatgggaatggaaggcataggatttggaataa 

ataaaatgggaggaatggaggggccctttggtggtggtatggaaaacatgggtcgatttg 

gatctgggatgaacatgggcaggataaatgaaatcctaagtaatgcactgaagagaggag 

agatcattgcaaagcagggaggaggtggaggtggaggaagcgtccctgggatcgagagga 

tgggtcctggcattgaccgcctcgggggtgccggcatggagcgcatgggcgcgggcctgg 

gccacggcatggatcgcgtgggctccgagatcgagcgcatgggcctggtcatggaccgca 

tgggctccgtggagcgcatgggctccggcattgagcgcatgggcccgctgggcctcgacc 

acatggcctccagcattgagcgcatgggccagaccatggagcgcattggctctggcgtgg 

agcgcatgggtgccggcatgggcttcggccttgagcgcatggccgctcccatcgaccgtg 

tgggccagaccattgagcgcatgggctctggcgtggagcgcatgggccctgccatcgagc 

gcatgggcctgagcatggagcgcatggtgcccgcaggtatgggagctggcctggagcgca 

tgggccccgtgatggatcgcatggccaccggcctggagcgcatgggcgccaacaatctgg 

agcggatgggcctggagcgcatgggcgccaacagcctcgagcgcatgggcctggagcgca 

tgggtgccaacagcctcgagcgcatgggccccgccatgggcccggccctgggcgctggca 

ttgagcgcatgggcctggccatgggtggcggtggcggtgccagctttgaccgtgccatcg 
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agatggagcgtggcaacttcggaggaagcttcgcaggttcctttggtggagctggaggcc 
atgctcctggggtggccaggaaggcctgccagatatttgtgagaaatctgccattcgatt 
tcacatggaagatgctaaaggacaaattcaacgagtgcggccacgtgctgtacgccgaca 
tcaagatggagaatgggaagtccaaggggtgtggtgtggttaagttcgagtcgccagagg 
5 tggccgagagagcctgccggatgatgaatggcatgaagctgagtggccgagagattgacg 
ttcgaattgatagaaacgcttaagcagttgccttttttaaacatcgatacgagacctctg 
aatttgtattttttcttgttaaccattttaatttgttggctggatgtataaagatgttta 
aaaaattcagttgctttttggggtaatttgaattacttttttaatgactggggttccatt 
tgactgtttgcattgagattgcaatgtgcgcaattttttttgtagttgtggcatcttgtt 
10 gacatcgaatatgactttgataataaataccggttcctgaaaaaaaaaaaaaaaaaaaaa 
aaa 

>ak026121 (TOXMARKER Assignment: 44; SEQ ID NO: 138) 
agttgctgtggaggccctggcacggctgcagcagggtgtgagcgccaccgttgcccacct 

15 tctggacctggcaggcagcgccggtgcgactgggagctggcgtagcccctctgagccaca 
ggagccgctggtgcaggacctgcaggctgctgtggccgccgtccagagtgccgtccacga 
gctgttggagtttgcccgcagcgcggtgggcaatgctgcccacacatctgaccgtgccct 
gcatgccaagcttagccggcagctgcagaagatggaggacgtgcaccagacgctggtggc 
acatggtcaggccctcgacgctggccggggaggctctggagccacccttgaggacctgga 

20 ccggctggtggcctgctcgcgggctgtgcccgaggacgccaagcagctggcctccttcct 
gcacggcaatgcctcactgctcttcagacggaccaaggccactgccccggggcctgaggg 
gggtggcaccctgcaccccaaccccactgacaagaccagcagcatccagtcacgacccct 
gccctcaccccctaagttcacctcccaggactcgccagatgggcagtacgagaacagcga 
ggggggctggatggaggactatgactacgtccacctacaggggaaggaggagtttgagaa 

25 gacccagaaggagctgctggaaaagggcagcatcacgcggcagggcaagagccagctgga 
gttgcagcagctgaagcagtttgaacgactggaacaggaggtgtcacggcccatagacca 
cgacctggccaactggacgccagcccaacccctggccccggggcgaacaggcggcctggg 
gccctcggaccggcagctgctgctcttctacctggagcagtgtgaggccaacctgaccac 
actgaccaacgccgtggacgccttctttaccgccgtggccaccaaccagccgcccaagat 

30 ctttgtggcgcacagcaagttcgtcatcctcagcgcccacaagctggtgttcatcgggga 
cacactgtcacggcaggccaaggctgctgacgtgcgcagccaggtgacccactacagcaa 
cctgctgtgcgacctcctgcgcggcatcgtggccaccaccaaggccgctgccttgcagta 
cccatcgccttccgcggcccaggacatggtggagagggtcaaggagctgggccacagcac 
ccagcagttccgccgcgtcctaggccagctggcagccgcctgagggtggtgaccccagga 

35 gggaggcaggggaggggtgcggcggtcccagctccctggctcccatgtcaagagtcgctg 
tgccacaggcttagggacaggaccccagctctgcgtcggtcctggtgccctggatgccca 
ggaatctgtatatatttatggccgggcagggtgtggggccatgcctcctcaggagccgaa 
gcccaggggccggccagtggccttccccagcatgcaccacgggcccgggttgggtcacca 
gacggggctggagtgtgagggtcctgcagcctgcaggacctcgtgccaccccgagggctg 

40 agcctggtcccacgagggtgccgtgtcccctgacagggccagtgcagtttggtgtgtcct 
ccgcctttccaggagaagaacctgaagaactatttttcgttattggttttccaatcattt 
gactaagagtctccatttaaataaagtttttaaaaggaaaaaaaaaaaaaaaaa 

>u90545.1 (TOXMARKER Assignment: 119; SEQ ID NO: 152) 

45 acgcgtccgcccacgcgtccgcccacgcgtccggtcggggccagagcgcaggtgtacctg 
gcggccgtgctggagcacctgaccgccgagatcctggagctggctggcaacccggcccgc 
gacaagaagacccgcatcatcctgcgccacctgtagctggccattcgcaacggcgaggag 
cttaacaagctgctgggcgaagtcaccatcgcgcagggcggtgtcctgcccaacattcag 
ggcgtgcttctgccccagaagaccaagagccaccacaaggccaagggtgaaaaccattca 

50 ctaggagaggagaaacacaatggccaccaagacagagttgagtcccacagcaagggagag 
caagaacgcacaagatatgcaagtggatgagacactgatccccaggaaaggtccaagttt 
atgttctgctcgctatggaatagccctcgtcttacatttctgcaatttcacaacgatagc 
acaaaatgtcatcatgaacatcaccatggtagccatggtcaacagcacaagccctcaatc 
ccagctcaatgattcctctgaggtgctgcctgttgactcatttggtggcctaagtaaagc 

55 cccaaagagtcttcctgcaaagtcctcaatacttgggggtcagtttgcaatttgggaaaa 
gtggggccctccacaagaacgaagcagactctgcagcattgctttatcaggaatgttact 
gggatgctttactgccatcctcataggtggcttcattagtgaaacccttgggtggccctt 
tgtcttctatatctttggaggtgttggctgtgtctgctgccttctctggtttgttgtgat 
ttatgatgaccccttttcctatccatggataagcacctcagaaaaagaatacatcatatc 

60 ctccttgaaacaacaggtcgggtcttctaagcagcctcttcccatcaaagctatgctcag 
atctctacccatttggtccatatgtttaggctgtttcagccatcaatggttagttagcac 
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aatggttgtatacataccaacttacatcagctctgtgtaccatgttaacatcagagacaa 
tggacttctatctgcccttccttttattgttgcctgggtcataggcatggtgggaggcta 
tctggcagatttccttctaaccaaaaagtttagactcatcactgtgaggaaaattgccac 
aattttaggaagtctcccctcttcagcactcattgtgtctctgccttacctcaattccgg 
5 ctatatcacagcaactgccttgctgacgctctcttgcggattaagcacattgtgtcagtc 
agggatttatatcaatgtcttagatattgctccaaggtattccagttttctcatgggagc 
atcaagaggattttcgagcatagcacctgtcattgtacccactgtcagcggatttcttct 
tagtcaggaccctgagtttgggtggaggaatgtcttcttcttgctgtttgccgttaacct 
gttaggactactcttctacctcatatttggagaagcagatgtccaagaatgggctaaaga 
10 gagaaaactcactcgtttatgaagttatcccaccttggatggaaaagtcattaggcaccg 
tattgcataaaatagaaggcttccgtgatgaaaataccagtgaaaagatttttttttcct 
gtggctcttttcaattatgagatcagttcattattttattcagacttttttttgagagaa 
atgtaagatgaataaaaattcaaataaaatgataactaagaaaaaaaaaaaaaaa 

15 Example 10: Identification of a TOXMARKER 76, 135, 147, 151, 152, 154, and 162 

for Accurate Prediction of Hepatotoxcity 

In order to determine the minimal number of markers required for prediction, backward 
selection from a larger set of putative markers was used. Each set of smaller markers was 
evaluated on cross-validation by linear regression and the smallest set that was significantly 
20 accurate (pO.OOl, Fisher's exact test) was selected. The TOXMARKER genes that make up the 
TOXMARKER 76, 135, 147, 151, 152, 154, and 162 is listed in Table 6 below 



Table 6. TOXMARKER 76, 135, 147, 151, 152, 154, and 162 



PTS Code 


Rat ID No. 


TOXMARKE 
R 


SEQ ID 
NO 


Gene 
Name 


Accuracy 


P. Value 


pts2.3014511.1 


scr gb-bi294409 1 


42 


42 


IFNAR-2 


0.771552 


4.12E-13 


pts2.3013420.1 


scr gb-af069306 1 


59 


59 


Transaldo 
lase 






pts2.3015170.1 


scr gb-bi288503 1 


65 


65 


Clp-1 






pts2.301 1880.1 


scr gb-d86383 2 


66 


66 


Hex 






pts2.3015871.2 


scr gb-bm986259 1 


97 


97 


Novel 






pts2.3012511.2 


scr_gb-aa899865 3 


76 


76 


Novel 






pts2.3017180.2 


cszr 204152648 1915 
21095 


71 


71 


Novel 







25 

"P.value" is the probability level that the observed classification is random. 

"Accuracy" is the number of correct predictions divided by the number of samples (total 
number of predictions). It is a proportion of how often the disclosed TOXMARKER 76, 135, 
147, 151, 152, 154, and 162s are accurate in screening for toxicity. 

30 
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Other Embodiments 

It is to be understood that while the invention has been described in conjunction with the 
detailed description thereof, the foregoing description is intended to illustrate and not limit the 
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scope of the invention, which is defined by the scope of the appended claims. Other aspects, 
advantages, and modifications are within the scope of the following claims. 
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